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Description 

Technical Field 

This invention is in the fields of genetic engineering, plant biology, and bacteriology. 



Background Art 

In the past decade, the science of genetic engineering has developed rapidly. A variety of processes 
are known for inserting a heterologous gene into bacteria, whereby the bacteria become capable of efficient 
expression of the inserted genes. Such processes normally involve the use of plasmids which may be 
cleaved at one or more selected cleavage sites by restriction endonucleases, discussed below. Typically, a 
gene of interest is obtained by cleaving one piece of DNA and the resulting DNA fragment is mixed with a 
fragment obtained by cleaving a vector such as a plasmid. The different strands of DNA are then connected 
("ligated") to each other to form a reconstituted plasmid. See, for example, U.S. Patents 4,237,224 (Cohen 
and Boyer, 1980); 4,264,731 (Shine, 1981); 4,273,875 (Manis, 1981); 4,322,499 (Baxter et al, 1982), and 
4,336,336 (Silhavy et al, 1982). A variety of other reference works are also available. Some of these works 
describe the natural processes whereby DNA is transcribed into messenger RNA (mRNA) and mRNA is 
translated into protein; see, e.g., Stryer, 1981 (note: all references cited herein, other than patents, are listed 
with citations after the Examples); Lehninger, 1975. Other works describe methods and products of genetic 
manipulation; see, e.g., Maniatis et al, 1982; Setlow and Hollaender, 1979. 

Most of the genetic engineering work performed to date involves the insertion of genes into various 
types of cells, primarily bacteria such as E. coli, various other types of microorganisms such as yeast, and 
mammalian cells. However, many of the techniques and substances used for genetic engineering of animal 
cells and microorganisms are not directly applicable to genetic engineering involving plants. 

As used herein, the term "plant" refers to a multicellular differentiated organism that is capable of 
photosynthesis, such as angiosperms and multicellular algae. This does not include microorganisms, such 
as bacteria, yeast, and fungi. However, the term "plant cells" includes any cell derived from a plant; this 
includes undifferentiated tissue such as callus or crown gall tumor, as well as plant seeds, propagules, 
pollen; and plant embryos. 

A variety of plant genes have been isolated, some of which have been published and/or are publicly 
available. Such genes include the soybean actin gene (Shah et al 1982), corn zein (Pederson et al, 1982) 
soybean leghemoglobin (Hyldig-Nielsen et al, 1982), and soybean storage proteins (Fischer and Goldberg, 
1982). 



The Regions of a Gene 

The expression of a gene involves the creation of a polypeptide which is coded for by the gene. This 
process involves at least two steps: part of the gene is transcribed to form messenger RNA, and part of the 
mRNA is translated into a polypeptide. Although the processes of transcription and translation are not fully 
understood, it is believed that the transcription of a DNA sequence into mRNA is controlled by several 
regions of DNA. Each region is a series of bases (i.e., a series of nucleotide residues comprising adenosine 
(A), thymidine (T), cytidine (C), and guanidine (G)) which are in a desired sequence. Regions which are 
usually present in a eucaryotic gene are shown on Figure 1. These regions have been assigned names for 
use herein, and are briefly discussed below. It should be noted that a variety of terms are used in the 
literature, which describes these regions in much more detail. 

An association region 2 causes RNA polymerase to associate with the segment of DNA. Transcription 
does not occur at association region 2; instead, the RNA polymerase normally travels along an intervening 
region 4 for an appropriate distance, such as about 100-300 bases, after it is activated by association region 
2. 

A transcription initiation sequence 6 directs the RNA polymerase to begin synthesis of mRNA. After it 
recognizes the appropriate signal, the RNA polymerase is believed to begin the synthesis of mRNA an 
appropriate distance, such as about 20 to about 30 bases, beyond the transcription initiation sequence 6. 
This is represented in Figure 1 by intervening region 8. 

The foregoing sequences are referred to collectively as the promoter region of the gene. 

The next sequence of DNA is transcribed by RNA polymerase into messenger RNA which is not 
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translated into protein. In general, the 5' end of a strand of mRNA attaches to a ribosome. In bacterial cells, 
this attachment is facilitated by a sequence of bases called a "ribosome binding site" (RBS). However, in 
eucaryotic cells, no such RBS sequence is known to exist. Regardless of whether an RBS exists in a strand 
of mRNA, the mRNA moves through the ribosome until a "start codon" is encountered. The start codon is 

5 usually the series of three bases, AUG; rarely, the codon GUG may cause the initiation of translation. The 
non-translated portion of mRNA located between the 5' end of the mRNA and the start codon is referred to 
as the 5' non-translated region 10 of the mRNA. The corresponding sequence in the DNA is also referred to 
herein as 5' non-translated region 12. The specific series of bases in this sequence is not believed to be of 
great importance to the expression of the gene; however, the presence of a premature start codon might 

10 affect the translation of the mRNA (see Kozak, 1978). 

A promoter sequence may be significantly more complex than described above; for example, certain 
promoters present in bacteria contain regulatory sequences that are often referred to as "operators." Such 
complex promoters may contain one or more sequences which are involved in induction or repression of 
the gene. One example is the lac operon, which normally does not promote transcription of certain lactose- 

75 utilizing enzymes unless lactose is present in the ceil. Another example is the trp operator, which does not 
promote transcription or translation of certain tryptophan-creating enzymes if an excess of tryptophan is 
present in the cell. See, e.g., Miller and Reznikoff, 1982. 

The next sequence of bases is usually called the coding sequence or the structural sequence 14 (in the 
DNA molecule) or 16 (in the mRNA molecule). As mentioned above, the translation of a polypeptide begins 

20 when the mRNA start codon, usually AUG, reaches the translation mechanism in the ribosome. The start 
codon directs the ribosome to begin connecting a series of amino acids to each other by peptide bonds to 
form a polypeptide, starting with methionine, which always forms the amino terminal end of the polypeptide 
(the methionine residue may be subsequently removed from the polypeptide by other enzymes). The bases 
which follow the AUG start codon are divided into sets of 3, each of which is a codon. The "reading frame", 

25 which specifies how the bases are grouped together into sets of 3, is determined by the start codon. Each 
codon codes for the addition of a specific amino acid to the polypeptide being formed. The entire genetic 
code (there are 64 different codons) has been solved; see, e.g., Lehninger, supra, at p. 962. For example, 
CUA is the codon for the amino acid leucine; GGU specifies glycine, and UGU specifies cysteine. 

Three of the codons (UAA, UAG, and UGA) are "stop" codons; when a stop codon reaches the 

30 translation mechanism of a ribosome, the polypeptide that was being formed disengages from the 
ribosome, and the last preceding amino acid residue becomes the carboxy terminal end of the polypeptide. 

The region of mRNA which is located on the 3' side of a stop codon in a monocistronic gene is referred 
to herein as 3' non-translated region 18. This region 18 is believed to be involved in the processing, 
stability, and/o7 transport of the mRNA after it is transcribed. This region 18 is also believed to contain a 

35 sequence of bases, poly-adenylation signal 20, which is recognized by an enzyme in the cell. This enzyme 
adds a substantial number of adenosine residues to the mRNA molecule, to form poly-A tail 22. 

The DNA molecule has a 3" non-translated region 24 and a poly-adenylation signal 26, which code for 
the corresponding mRNA regio~n 18 and signal 20. However, the DNA molecule does not have a poly-A tail. 
Poly-adenylation signals 20 (mRNA) and 26 (DNA) are represented in the figures by a heavy dot. 

40 

Gene-Host Incompatibility 

The same genetic code is utilized by all living organisms on Earth. Plants, animals, and microorganisms 
45 all utilize the same correspondence between codons and amino acids. However, the genetic code applies 
only to the structural sequence of a gene, i.e., the segment of mRNA bounded by one start codon and one 
stop codon which codes for the translation of mRNA into polypeptides. 

However, a gene which performs efficiently in one type of cell may not perform at all in a different type 
of cell. For example, a gene which is expressed in E. coli may be transferred into a different type of 
so bacterial cell, a fungus, or a yeast. However, the gene might not be expressed in the new host cell. There 
are numerous reasons why an intact gene which is expressed in one type of cell might not be expressed in 
a different type of cell. See, e.g., Sakaguchi and Okanishi, 1981. Such reasons include: 

1. the gene might not be replicated or stably inherited by the progeny of the new host cell. 

2. the gene might be broken apart by restriction endonucleases or other enzymes in the new host cell. 
55 3. the promoter region of the gene might not be recognized by the RNA polymerases in the new host 

cell. 

4. one or more regions of the gene might be bound by a repressor protein or other molecule in the new 
host cell, because of a DNA region which resembles an operator or other regulatory sequence of the host's 
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DNA. For example, the lac operon includes a polypeptide which binds to a particular sequence of bases 
next to the lac promoter unless the polypeptide is itself inactivated by -lactose. See, e.g., Miller and 
Reznikoff, 1982. 

5. one or more regions of the gene might be deleted, reorganized, or relocated to a different part of the 
5 host's genome. For example, numerous procaryotic cells are known to contain enzymes which promote 
genetic recombination; (such as the rec proteins in E. coli; see, e.g., Shibata et al, 1979) and transposition 
(see, e.g., The 45th Cold Spring Harbor SymposiurrTon Quantitative Biology, 1981). In addition, naturally- 
occurring genetic modification can be enhanced by regions of homology between different strands of DNA; 
see, e.g., Radding, 1978. 

70 6. mRNA transcribed from the gene may suffer from a variety of problems. For example, it might be 
degraded before it reaches the ribosome, or it might not be poly-adenylated or transported to the ribosome, 
or it might not interact properly with the ribosome, or it might contain an essential sequence which is 
deleted by RNA processing enzymes. 

7. the polypeptide which is created by translation of the mRNA coded for by the gene may suffer from 

75 a variety of problems. For example, the polypeptide may have a toxic effect on the cell, or it may be 
glycosylated or converted into an altered polypeptide, or it may be cleaved into shorter polypeptides or 
amino acids, or it may be sequestered within an intracellular compartment where it is not functional. 

In general, the likelihood of a foreign gene being expressed in a cell tends to be lower if the new host 
cell is substantially different from the natural host cell. For example, a gene from a certain species of 

20 bacteria is likely to be expressed by other species of bacteria within the same genus. The gene is less 
likely to be expressed by bacteria of a different genus, and even less likely to be expressed by non- 
bacterial microorganisms such as yeast, fungus, or algae. It is very unlikely that a gene from a cell of one 
kingdom (the three kingdoms are plants, animals, and "protista" (microorganisms)) could be expressed in 
cells from either other kingdom. 

25 These and other problems have, until now, thwarted efforts to obtain expression of foreign genes into 
plant cells. For example, several research teams have reported the insertion of foreign DNA into plant cells; 
see, e.g., Lurquin, 1979; Krens et al, 1982; Davey et al, 1980. At least three teams of researchers have 
reported the insertion of entire genes into plant cells. By use of radioactive DNA probes, these researchers 
have reported that the foreign genes (or at least portions thereof) were stably inherited by the descendants 

30 of the plant cells. See Hemalsteens et al, 1980; Qarfinkel et al, 1981; Matzke and Chilton, 1981. However, 
there was no reported evidence that the foreign genes were expressed in the plant cells. 

Several natural exceptions to the gene-host incompatibility barriers have been discovered. For example, 
several E. coli genes can be expressed in certain types of yeast cells, and vice-versa. See Beggs, 1978; 
Struhl et al, 1979. 

35 In addition, certain types of bacterial cells, including Agrobacterium tumefaciens and A. rhizogenes, are 
capable of infecting various types of plant cells, causing plant diseases such as crown gaT tumor and hairy 
root disease. These Agrobacterium cells carry plasmids, designated as Ti plasmids and Ri plasmids, which 
carry genes which are expressed in plant cells. Certain of these genes code for enzymes which create 
substances called "opines," such as octopine, nopaline, and agropine. Opines are utilized by the bacteria 

40 cells as sources of carbon, nitrogen, and energy. See, e.g., Petit and Tempe, 1978. The opine genes are 
believed to be inactive while in the bacterial cells; these genes are expressed only after they enter the plant 
cells. 

In addition, a variety of man-made efforts have been reported to overcome one or more of the gene- 
host incompatibility barriers. For example, it has been reported that a mammalian polypeptide which is 
45 normally degraded within a bacterial host can be protected from degradation by coupling the mammalian 
polypeptide to a bacterial polypeptide that normally exists in the host cell. This creates a "fusion protein;" 
see, e.g., Itakura et al, 1977. As another example, in order to avoid cleavage of an inserted gene by 
endonucleases in the host cell, it is possible to either (1) insert the gene into host cells which are deficient 
in one or more endonucleases, or (2) duplicate the gene in cells which cause the gene to be methylated. 



so See, e.g., Maniatis et al, 1981. 

In addition, various efforts to overcome gene-host incompatibility barriers involve chimeric genes. For 
example, a structural sequence which codes for a mammalian polypeptide, such as insulin, interferon, or 
growth hormone, may be coupled to regulatory sequences from a bacterial gene. The resulting chimeric 
gene may be inserted into bacterial cells, where it will express the mammalian polypeptide. See, e.g., 

55 Guarente et al, 1980. Alternately, structural sequences from several bacterial genes have been coupled to 
regulatory sequences from viruses which are capable of infecting mammalian cells. The resulting chimeric 
genes were inserted into mammalian cells, where they reportedly expressed the bacterial polypeptide. See, 
e.g., Southern and Berg, 1982; Colbere-Garapin et al, 1982. 
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Restriction Endonucleases 

In general, an endonuclease is an enzyme which is capable of breaking DNA into segments of DNA. An 
endo nuclease is capable of attaching to a strand of DNA somewhere in the middle of the strand, and 
5 breaking it. By comparison, an exo nuclease removes nucleotides from the end of a strand of DNA. All of 
the endonucleases discussed herein are capable of breaking double-stranded DNA into segments. This may 
require the breakage of two types of bonds: (1) covalent bonds between phosphate groups and deoxyribose 
residues, and (2) hydrogen bonds (A-T and C-Q) which hold the two strands of DNA to each other. 

A "restriction endonuclease" (hereafter referred to as an endonuclease) breaks a segment of DNA at a 
jo precise sequence of bases. For example, EcoRI and Haelll recognize and cleave the following sequences: 



ECORI : 



ATTC 

ctta3g-5» 



YYCTTAA 



AATTCXX 
GYY 



XXGG 
YYCC 



CCXX 
GGYY 



In the examples cited above, the EcoRI cleavage created a "cohesive" end with a 5' overhang (i.e., the 
single-stranded "tail" has a 5' end rather than a 3' end). Cohesive ends can be useful in promoting desired 
ligations. For example, an EcoRI end is more likely to anneal to another EcoRI end than to a Haelll end. 

Over 100 different endonucleases are known, each of which is capable of cleaving DNA at specific 
sequences. See, e.g., Roberts, 1982. All restriction endonucleases are sensitive to the sequence of bases. 
In addition, some endonucleases are sensitive to whether certain bases have been methylated. For 
example, two endonucleases, Mbol and Sau3a are capable of cleaving the following sequence of bases as 
shown: 



5-XXfeATCXX 

YYCTA 



5»-XX 

YYCTAG 



GATCXX 
YY 



Mbol cannot cleave this sequence if the adenine residue is methylated (me-A). Sau3a can cleave this 
sequence, regardless of whether either A is methylated. To some extent the methylation (and therefore the 
45 cleavage) of a plasmid may be controlled by replicating the plasmids in cells with desired methylation 
capabilities. An E. coli enzyme, DNA adenine methylase (dam), methylates the A residues that occur in 
GATC sequences? Strains of E. coli which do not contain the dam enzyme are designated as dam-ceils. 
Cells which contain dam are designated as dam + or dam cells. 

Several endonucleases are known which cleave different sequences, but which create cohesive ends 
50 which are fully compatible with cohesive ends created by other endonucleases. For example, at least five 
different endonucleases create 5 ! GATC overhangs, as shown in Table 1 . 



55 
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Table 1 



Endonuclgasg 
Mbol 

Inhibited by me-A 

Sau3a 
Unaffected by me-A 



Sequence 




Bglll 
Unaffected by me-A 

Bell 

Inhibited by me-A 



BamHI 
Unaffected by me-A 



A cohesive end created by any of the enconucleases listed in Table 1 will ligate preferentially to a 
cohesive end created by any of the other endonucleases. However, a ligation of, for example, a Bglll end 
with a BamHI end will create the following sequence: 

AGATCC 
TCTAGG 



This sequence cannot be cleaved by either Bgl 
(unless methylated) or by Sau3a. 

Another endonuclease which involves the GATC sequence is Pvul, which creates 
follows: 



BamHI; however, it can be cleaved by Mbol 
overhang, as 



Another endonuclease, Clal, cleaves the following sequence: 
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X 1 AIgGATX 2 

y tagcJtay 

If Xi is G, or if X2 is C, then the sequence may be cleaved by Mbol (unless methylated, in which case 
Clal is also inhibited) or Sau3a. 



Viral Promoters 

A virus is a microorganism comprising single or double stranded nucleic acid (DNA or RNA) contained 
within a protein (and possibly lipid) shell called a "capsid" or "coat". A virus is smaller than a cell, and it 
does not contain most of the components and substances necessary to conduct most biochemical 
processes. Instead, a virus infects a cell and uses the cellular processes to reproduce itself. 

The following is a simplified description of how a DNA-containing virus infects a cell; RNA viruses will 
be disregarded in this introduction for the sake of clarity. First, a virus attached to or enters a cell; normally 
called a "host" cell. The DNA from the virus (and possible the entire viral particle) enters the host cell 
where it usually operates as a plasmid (a loop of extra-chromosomal DNA). The viral DNA is transcribed 
into messenger RNA, which is translated into one or more polypeptides. Some of these polypeptides are 
assembled into new capsids, while others act as enzymes to catalyze various biochemical reactions. The 
viral DNA is also replicated and assembled with the capsid polypeptides to form new viral particles. These 
viral particles may be released gradually by the host cell, or they may cause the host cell to lyse and 
release them. The released viral particles subsequently infect new host cells. For more background 
information on viruses see, e.g., Stryer, 1981 and Matthews, 1970. 

As used herein, the term "virus" includes phages and viroids, as well as replicative intermediates. As 
used herein, the phrases "viral nucleic acid" and "DNA or RNA derived from a virus" are construed broadly 
to include any DNA or RNA that is obtained or derived from the nucleic acid of a virus. For example, a DNA 
strand created by using a viral RNA strand as a template, or by chemical synthesis to create a known 
sequence of bases determined by analyzing viral DNA, would be regarded as viral nucleic acid. 

The host range of any virus (i.e., the variety of cells that a type of virus is capable of infecting) is 
limited. Some viruses are capable of efficient infection of only certain types of bacteria; other viruses can 
infect only plants, and may be limited to certain genera; some viruses can infect only mammalian cells. 
Viral infection of a cell requires more than mere entry of the viral DNA or RNA into the host cell; viral 
particles must be reproduced within the cell. Through various assays, those skilled in the art can readily 
determine whether any particular type of virus is capable of infecting any particular genus, species, or strain 
of cells. As used herein, the term "plant virus" is used to designate a virus which is capable of infecting one 
or more types of plant cells, regardless of whether it can infect other types of cells. 

With the possible exception of viroids (which are poorly understood at present), every viral particle must 
contain at least one gene which can be "expressed" in infected host cells. The expression of a gene 
requires that a segment of DNA or RNA must be transcribed into or function as a strand of messenger RNA 
(mRNA), and the mRNA must be translated into a polypeptide. Most viruses have about 5 to 10 different 
genes, all of which are expressed in a suitable host cell. 

Promoters from viral genes have been utilized in a variety of genetic engineering applications. For 
example, chimeric genes have been constructed using various structural sequences (also called coding 
sequences) taken from bacterial genes, coupled to promoters taken from viruses which can infect 
mammalian cells (the most commonly used mammalian viruses are designated as Simian Virus 40 (SV40) 
and Herpes Simplex Virus (HSV)). These chimeric genes have been used to transform mammalian cells. 
See, e.g., Mulligan et al 1979; Southern and Berg 1982. In addition, chimeric genes using promoters taken 
from viruses which can infect bacterial cells have been used to transform bacterial cells; see, e.g., the 
phage lambda PL promoter discussed in Maniatis et al, 1982. 

Several researchers have theorized that it might be possible to utilize plant viruses as vectors for 
transforming plant cells. See, e.g., Hohn et al, 1982. In general, a "vector" is a DNA molecule useful for 
transferring one or more genes into a cell. Usually, a desired gene is inserted into a vector, and the vector 
is then used to infect the host cell. 

Several researchers have theorized that it might be possible to create chimeric genes which are 
capable of being expressed in plant cells, by using promoters derived from plant virus genes. See, e.g., 
Hohn et al, 1982, at page 216. 
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However, despite the efforts of numerous research teams, prior to this invention no one had succeeded 
in (1) creating a chimeric gene comprising a plant virus promoter coupled to a heterologous structural 
sequence and (2) demonstrating the expression of such a gene in any type of plant cell. 



Cauliflower Mosaic Virus (CaMV) 

The entire DNA sequence of CaMV has been published. Gardner et al, 1981; Hohn et al, 1982. In its 
most common form, the CaMV genome is about 8000 bp long. However, various naturally occurring 

io infective mutants which have deleted about 500 bp have been discovered; see Howarth et al 1981. The 
entire CaMV genome is transcribed into a single mRNA, with a sedimentation coefficient of 32S. The 
promoter for the 32S mRNA is located in the large intergenic region about 1 kb counterclockwise from Gap 
1 (see Guilley etal, 1982). 

CaMV is believed to generate at least eight proteins; the corresponding genes are designated as Genes 

r5 I through VIII. Gene VI is transcribed into mRNA with a sedimentation coefficient of 195. The 19S mRNA is 
transcribed into a protein designated as P66, which is an inclusion body protein. The 19S mRNA is 
promoted by the 19S promoter, located about 2.5 kb counterclockwise of Gap 1. 



20 SUMMARY OF THE INVENTION 

This invention relates to chimeric genes which are capable of being expressed in plant cells, and to a 
method for creating such genes. 

The chimeric gene comprises a promoter region which is capable of causing RNA polymerase in a 

25 plant cell to create messenger RNA corresponding to the DNA. One such promoter region comprises a 
nopaline synthase (NOS) promoter region, which normally exists in certain types of Ti plasmids in bacteria, 
A. tumefaciens. The NOS gene normally is inactive while contained in A. tumefaciens cells, and it becomes 
active after the Ti plasmid enters a plant cell. Two other promoters have been derived from the cauliflower 
mosaic virus (CaMV). Other suitable promoter regions may be derived from genes which exist naturally in 

30 plant cells, or from viruses which are capable of infecting plant cells. 

The chimeric gene also contains a sequence of bases which codes for a 5' non-translated region of 
mRNA which is capable of enabling or increasing the expression in a plant cell of a structural sequence of 
the mRNA. For example, a suitable 5' non-translated region may be taken from the NOS gene, from a plant 
virus gene, or from a gene which exists naturally in plant cells. 

35 The chimeric gene also contains a desired structural sequence, i.e., a sequence which is transcribed 
into mRNA which is capable of being translated into a desired polypeptide. The structural sequence is 
heterologous with respect to the promoter region, and it may code for any desired polypeptide, such as a 
bacterial or mammalian protein. The structural sequence includes a start codon and a stop codon. The 
structural sequence may contain introns which are removed from the mRNA prior to translation. 

40 The chimeric gene also contains a DNA sequence which codes for a 3' non-translated region (including 
a poly-adenylation signal) of mRNA. This region may be derived from a gene which is naturally expressed 
in plant cells, to help ensure proper expression of the structural sequence. Such genes include the NOS 
gene, plant virus genes, and genes which exist naturally in plant cells. 

The method of this invention is described below, and is summarized in the flow chart of Figure 2. 

45 If properly assembled and inserted into a plant genome, a chimeric gene of this invention will be 
expressed in the plant cell to create a desired polypeptide, such as a mammalian hormone, or a bacterial 
enzyme which confers antibiotic or herbicide resistance upon the plant. 



so Brief Description of the Drawings 



The figures herein are schematic representations; they have not been drawn to scale. 
Figure ~\_ represents the structure of a typical eukaryotic gene. 

Figure 2 is a flow chart representing the steps of this invention, correlated with an example chimeric 
55 NOS-NPTII-NOS gene. 

Figure 3 represents fragment Hindlll-23, obtained by digesting a Ti plasmid with Hindlll. 
Figure 4 represents a DNA fragment which contains a NOS promoter region, a NOS 5' non-translated 
region, and the first few codons of the NOS structural sequence. 



10 



EP 0 131 623 B1 



Figure 5 represents the cleavage of a DNA sequence at a precise location, to obtain a DNA fragment 
which contains a NOS promoter region and complete 5' non-translated region. 

Figure 6 represents the creation of plasmids pMON1001 and pMON40, which contain an NPTII 
structural sequence. 

Figure 7 represents the insertion of a NOS promoter region into plasmid pMON40, to obtain pMON58. 

Figure 8 represents the creation of an M13 derivative designated as M-2, which contains a NOS 3' non- 
translated region and poly-A signal. 

Figure 9 represents the assembly of the NOS-NPTII-NOS chimeric gene, and the insertion of the 
chimeric gene into plasmid pMON38 to obtain plasmids pMON75 and pMON76. 

Figure 10 represents the insertion of the NOS-NPTII-NOS chimeric gene into plasmid pMON120 to 
obtain plasmids pMON128 and pMON129. 

Figure 11 represents the creation of plasmid pMON66, which contains an NPTI gene. 

Figure T2 represents the creation of plasmid pMON73, containing a chimeric NOS-NPTII sequence. 

Figure T3 represents the creation of plasmid pMON78, containing a chimeric NOS-NPTI sequence. 

Figure 14 represents the creation of plasmids pMON106 and pMON107, which contain chimeric NOS- 
NPTTNOS genes. 

Figure 15 represents the insertion of a chimeric NOS-NPTI-NOS gene into pMON120 to obtain 
plasmids pMON130 and pMON131. 

Figure 16 represents the structure of a DNA fragment containing a soybean protein (sbss) promoter. 

Figure 17 represents the creation of plasmid pMON121, containing the sbss promoter. 

Figure T8 represents the insertion of a chimeric sbss-NPTII-NOS gene into pMON120 to create 
plasmids pMON141 and pMON142. 

Figure 19 represents the creation of plasmid pMON108, containing a bovine growth hormone structural 
sequence and a NOS 3' region. 

Figure 20 represents the creation of plasmid N25-BGH, which contains the BGH-NOS sequence 
surrounded by selected cleavage sites. 

Figure 21 represents the insertion of a chimeric sbss-BGH-NOS gene into pMON120 to obtain plasmids 
pMON147 and pMON148. 

Figure 22 represents the creation of plasmid pMON149, which contains a chimeric NOS-BGH-NOS 
gene^ 

Figure 23 represents the creation of plasmid pMON8, which contains a structural sequence for EPSP 
synthase. 

Figure 24 represents the creation of plasmid pMON25, which contains an EPSP synthase structural 
sequence with several cleavage sites near the start codon. 

Figure 25 represents the creation of plasmid pMON146, which contains a chimeric sequence compris- 
ing EPSP synthase and a NOS 3' region. 

Figure 26 represents the insertion of a chimeric NOS-EPSP-NOS gene into pMON120 to obtain plasmid 
pMON153. _ 

Figure 27 represents the creation of plasmid pMON154, which contains a chimeric sbss-EPSP-NOS 
gene. 

Figure 28 represents the creation and structure of plasmid pMON93, which contains a CaMV 19S 
promoter. 

Figure 29 represents the creation and structure of plasmid pMON156, which contains a chimeric CaMV- 
(19S)-NPT-NOS gene. 

Figure 30 represents the creation and structure of plasmid pMON110, which contains a partial NPT 
gene. 

Figure 31 represents the creation and structure of plasmid pMON132, which contains a partial NPT- 
NOS gene. - 

Figure 32 represents the creation and structure of plasmid pMON155, which contains a chimeric CaMV- 
(19SyNPT-NOS gene. 

Figure 33 represents the creation and structure of plasmid pDMON81, which contains a CaMV 32S 
promoter. 

Figure 34 represents the creation and structure of plasmid pMON125, which contains a CaMV 32S 
promoter. 

Figure 35 represents the creation and structure of plasmid pMON172, which contains a CaMV 32S 
promoter. 

Figure 36 represents the creation and structure of phage M12, which contains a CaMV 32S promoter. 
Figure 3*7 represents the creation and structure of plasmids pMON183 and pMON184, which contain 
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chimeric CaMV(32S)-NPT-NOS genes. 



DETAILED DESCRIPTION OF THE INVENTION 

In one preferred embodiment of this invention, a chimeric gene was created which contained the 
following elements: 

1 . a promoter region and a 5' non-translated region derived from a nopaline synthase (NOS) gene; 

2. a structural sequence derived from a neomycin phosphotransferase II (NPTII or NPT II) gene; and, 
to 3. a 3' non-translated region, including a poly-adeny!ation signal, derived from a NOS gene. 

This chimeric gene, referred to herein as a NOS-NPTII-NOS gene, was assembled and inserted into a 
variety of plant cells, causing them to become resistant to aminoglycoside antibiotics such as kanamycin. 

The method used to assemble this chimeric gene is summarized in the flow chart of Figure 2, and 
described in detail below and in the examples. To assist the reader in understanding the steps of this 

75 method, various plasmids and fragments involved in the NOS-NPTll-NOS chimeric gene are cited in 
parentheses in Figure 2. However, the method of Figure 2 is applicable to a wide variety of other plasmids 
and fragments. To further assist the reader, the steps shown in Figure 2 have been assigned callout 
numbers 42 et seq. These callout numbers are cited in the following description. The techniques and DNA 
sequences of this invention are likely to be useful in the transformation of a wide variety of plants, including 

20 any plant which may be infected by one or more strains of A. tumefaciens or A. rhizogenes. 



The NOS Promoter Region and 5' Non-translated Region 

25 The Applicants decided to obtain and utilize a nopaline synthase (NOS) promoter region to control the 
expression of the heterologous gene. The NOS is normally carried in certain types of Ti plasmids, such as 
PTiT37 (Sciaky et al, I978). The NOS promoter is normally inactive while in an A. tumefaciens cell. The 
entire NOS gene, including the promoter and the protein coding sequence, is withirTthe T-DNA portion of a 
Ti plasmid that is inserted into the chromosomes of plant cells when a plant becomes infected and forms a 

30 crown gall tumor. Once inside the plant cell, the NOS promoter region directs RNA polymerase within a 
plant cell to transcribe the NOS protein coding sequence into mRNA, which is subsequently translated into 
the NOS enzyme. 

The boundaries between the different parts of a promoter region (shown in Figure 1 as association 
region 2, intervening region 4, transcription initiation sequence 6, and intervening region 8), and the 

35 boundary between the promoter region and the 5' non-translated region, are not fully understood. The 
Applicants decided to utilize the entire promoter region and 5' non-translated region from the NOS gene, 
which is known to be expressed in plant cells. However, it is entirely possible that one or more of these 
sequences might be modified in various ways, such as alteration in length or replacement by other 
sequences. Such modifications in promoter regions and 5' non-translated regions have been studied in 

40 bacterial cells (see, e.g., Roberts et al 1979) and mammalian cells (see, e.g., McKnight, 1982). By utilizing 
the methodology taught by this invention, it is now possible to study the effects of modifications to promoter 
regions and 5' non-translated regions on the expression of genes in plant cells. It may be possible to 
increase the expression of a gene in a plant cell by means of such modifications. Such modifications, if 
performed upon chimeric genes of this invention, are within the scope of this invention. 

45 A nopaline-type tumor-inducing plasmid, designated as pTiT37, was isolated from a strain of A. 
tumefaciens using standard procedures (Currier and Nester, 1976). It was digested with the endonuclease 
Hindlll which produced numerous fragments. These fragments were separated by size on a gel, and one of 
the fragments was isolated and removed from the gel. This fragment was designated as the Hindlll-23 
fragment, because it was approximately the 23rd largest fragment from the Ti plasmid; it is approximately 

50 3400 base pairs (bp) in size, also referred to as 3.4 kilobases (kb). From work by others (see, e.g., 
Hernalsteens et al, 1980), it was known that the Hindlll-23 fragment contained the entire NOS gene, 
including the promoter region, a 5' non-translated region, a structural sequence with a start codon and a 
stop codon, and a 3' non-translated region. The Hindlll-23 fragment is shown in Figure 3. 

By means of various cleavage and sequencing experiments, it was determined that the Hindlll-23 

55 fragment could be digested by another endonuclease, Sau3a, to yield a fragment, about 350 bp in size, 
which contains the entire NOS-promoter region, the 5' non-translated region, and the first few codons of the 
NOS structural sequence. This fragment was sequenced, and the base sequence is represented in Figure 4. 
The start codon (ATG) of the NOS structural sequence begins at base pair 301 within the 350 bp fragment. 
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The Applicants decided to cleave the fragment between base pairs 300 and 301 ; this would provide them 
with a fragment about 300 base pairs long containing a NOS promoter region and the entire 5' non- 
translated region but with no translated bases. To cleave the 350 bp fragment at precisely the right location, 
the Applicants obtained an M13 clone designated as SIA, and utilized the procedure described below. 

5 To create the SIA clone, Dr. Michael Bevan of Washington University converted the 350 bp Sau3a 

fragment into a single strand of DNA. This was done by utilizing a virus vector, designated as the M13 mp2 
phage, which goes through both double-stranded (ds) and single-stranded (ss) stages in its life cycle 
(Messing et al, 1981). The ds 350 bp fragment was inserted into the double-stranded replicative form DNA 
of the M13 mp2, which had been cleaved with BamHI. The two fragments were ligated, and used to infect 

w E. coli cells. The ds DNA containing the 350 bp inserted fragment subsequently replicated, and one strand 
(theviral strand) was encapsulated by the M13 viral capsid proteins. In one clone, designated the SIA, the 
orientation of the 350 bp fragment was such that the anti-sense strand (containing the same sequence as 
the mRNA) of the NOS gene was carried in the viral strand. Viral particles released from infected cells were 
isolated, and provided to the Applicants. 

15 Single stranded SIA DNA, containing the anti-sense 350 bp fragment with the NOS promoter region, 
was isolated from the viral particles and sequenced. A 14-mer oligonucleotide primer was synthesized, 
using published procedures (Beaucage and Carruthers, 1981, as modified by Adams et al, 1982). This 14- 
mer was designed to be complementary to bases 287 through 300 of the 350 bp fragment, as shown on 
Figure 4. 

20 The 5' end of the synthetic primer was radioactively labelled with 32 P; this is represented in the figures 
by an asterisk. 

Copies of the primer were mixed with copies of the single-stranded SIA DNA containing the anti-sense 
strand of the 350 bp fragment. The primer annealed to the desired region of the SIA DNA, as shown at the 
top of Figure 5. After this occurred, Klenow DNA polymerase and a controlled quantity of unlabelled deoxy- 
25 nucleoside triphosphates (dNTP's), A, T, C, and G, were added. Klenow polymerase added nucleotides to 
the 3' (unlabelled) end of the primer, but not to the 5' (labelled) end. The result, as shown in Figure 5, was a 
circular loop of single-stranded DNA, part of which was matched by a second strand of DNA. The 5' end of 
the second strand was located opposite base #300 of the Sau3a insert. 

The partially double-stranded DNA was then digested by a third endonuclease, Haelll, which can cleave 
so both single-stranded and double-stranded DNA. Haelll cleavage sites were known to exist in several 
locations outside the 350 bp insert, but none existed inside the 350 bp insert. This created a fragment 
having one blunt end, and one 3' overhang which started at base #301 of the Sau3a insert. 

The Haelll fragment mixture was treated with T4 DNA polymerase and unlabelled dNTP's. This caused 
the single stranded portion of the DNA, which extended from base #301 of the Sau3a insert to the closest 
35 Haelll cleavage site, to be removed from the fragment. In this manner, the ATG start codon was removed 
from base pair #300, leaving a blunt end double-stranded fragment which was approximately 550 bp long. 

The mixture was then digested by a fourth endonuclease EcoRI, which cleaved the 550 bp fragment at 
a single site outside the NOS promoter region. The fragments were then separated by size on a gel, and 
t"o radioactively-labelled fragment was isolated. This fragment contained the entire NOS promoter region 
40 and 5' non-translated region. It had one blunt end with a sequence of 

5'-. . .CTGCA 
. . .GACGT 

45 

and one cohesive end (at the EcoRI site) with a sequence of 
5 ' AATTC- 

50 G- 

The shorter strand was about 308 bp long. 

The foregoing steps are represented in Figure 2 as steps 42, 44, and 46. 
55 This fragment was inserted into pMON40 (which is described below) to obtain pMON58, as shown on 
Figure 7. 
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Creation of plasmid with NPT II gene (pMON40) 

A bacterial transposon, designated as Tn5, is known to contain a complete NPT II gene, including the 
promoter region, structural sequence, and 3' non-translated region. The NPT II enzyme inactivates certain 
aminoglycoside antibiotics, such as kanamycin, neomycin, and G418; see Jimenez and Davies, 1980.. This 
gene is contained within a 1 .8 kb fragment, which can be obtained by digesting phage lambda bbkan-1 
DNA (D. Berg et al, 1975) with two endonucleases, Hindlll and BamHi. This fragment was inserted into a 
common laboratory plasmid, pBR327, which had been digested by Hindlll and BamHI. As shown in Figure 
6, the resulting plasmid was designated as pMON1001, which was about 4.7 kb. 

To reduce the size of the DNA fragment which carried the NPT II structural sequence, the Applicants 
eliminated about 500 bp from the pMON1001 plasmid, in the following manner. First, they digested 
pMON1001 at a unique Smal restriction site which was outside of the NPT II gene. Next, they inserted a 10- 
mer synthetic oligonucleotide linker, 

5» CCGGATCCGG 
GGCCTAGGCC 

into the Smal cleavage site. This eliminated the Smal cleavage site and replaced it with a BamHI cleavage 
site. A second BamHI cleavage site already existed, about 500 bp from the new BamHI site. The Applicants 
digested the plasmid with BamHI, separated the 500 bp fragment from the 4.2 kb fragment, and circularized 
the 4.2 kb fragment. The resulting plasmids were inserted into E. coli, which were then selected for 
resistance to ampicillin and kanamycin. A clonal colony of E. coli "was selected; these cells contained a 
plasmid which was designated as pMON40, as shown in Figure" 6~ 

The foregoing steps are represented in Figure 2 as steps 48 and 50. 



Insertion of NOS promoter into plasmid pMON40 

The Applicants deleted the NPT II promoter from pMON40. and replaced it with the NOS promoter 
fragment described previously, by the following method, shown on Figure 7. 

Previous cleavage and sequencing experiments (Rao and Rogers, 1979; Auerswald et al, 1980) 
indicated that a Bglll cleavage site existed in the NPT II gene between the promoter region and the 
structural sequence. Plasmid pMON40 was digested with Bglll. The cohesive ends were then filled in by 
mixing the cleaved plasmid with Klenow polymerase and the four dNTP's, to obtain the following blunt 
ends: 

5' - AGATC GATCT- 

- TCTAG CTAGA-5 ' 

The polymerase and dNTP's were removed, and the cleaved plasmid was then digested with EcoRI. The 
smaller fragment which contained the NPT II promoter region was removed, leaving a large fragment with 
one EcoRI end and one blunt end. This large fragment was mixed with the 308 bp fragment which 
contained the NOS promoter, described previously and shown on Figure 5. The fragments were ligated, and 
inserted into E. coli. E. coli clones were selected for ampicillin resistance. Replacement of the NPT II 
promoter region (a bacteriaTpromoter) with the NOS promoter region (which is believed to be active only in 
plant cells) caused the NPT II structural sequence to become inactive in E. coli. Plasmids from 36 
kanamycin-sensitive clones were obtained; the plasmid from one clone, designated as pMON58, was 
utilized in subsequent work. 

The foregoing steps are represented in Figure 2 as steps 52 and 54. 

Plasmid pMON58 may be digested to obtain a 1.3 kb EcoRI-BamHI fragment which contains the NOS 
promoter region, the NOS 5' non-translated region, and the NPT II structural sequence. This step is 
represented in Figure 2 as step 56. 



Insertion of NOS 3' sequence into NPT II gene 
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As mentioned above in "Background Art", the functions of 3' non-translated regions in eucaryotic genes 
are not fully understood. However, they are believed to contain at least one important sequence, a poly- 
adenylation signal. 

It was suspected by the Applicants that a gene having a bacterial 3' non-translated region might not be 
5 expressed as effectively in a plant cell as the same gene having a 3' non-translated region from a gene, 
such as NOS, which is known to be expressed in plants. Therefore, the Applicants decided to add a NOS 3' 
non-translated region to the chimeric gene, in addition to the NPT II 3' non-translated region already 
present. Alternately, it is possible, using the methods described herein, to delete the NPT II or other 
existing 3' non-translated region and replace it with a desired 3' non-translated region that is known to be 
70 expressed in plant cells. Whether a different type of 3' non-translated region (such as a 3' region from an 
octopine-type or agropine-type Ti plasmid, or a 3' region from a gene that normally exists in a plant cell) 
would be suitable or preferable for use in any particular type of chimeric gene, for use in any specific type 
of plant cell, may be determined by those skilled in the art through routine experimentation using the 
method of this invention. 

75 Those skilled in the art may also determine through routine experimentation whether the 3' non- 
translated region that naturally follows a structural sequence that is to be inserted into a plant ceil will 
enhance the efficient expression of that structural sequence in that type of plant cell. If so, then the steps 
required to insert a different 3' non-translated region into the chimeric gene might not be required in order 
to perform the method of this invention. 

20 In order to obtain a DNA fragment containing a NOS 3' non-translated region appropriate for joining to 
the NPT II structural sequence from pMON58 (described previously), the Applicants utilized a 3.4 kb Hindlll- 
23 fragment from a Ti plasmid, shown on Figure 3. This 3.4 kb fragment was isolated and digested with 
BamHI to obtain a 1.1 kb BamHI-Hindlll fragment containing a 3' portion of the NOS structural sequence 
(including the stop codon), and the 3' non-translated region of the NOS gene (including the poly-adenylation 

25 signal). This 1.1 kb fragment was inserted into a pBR327 plasmid which had been digested with Hindlll and 
BamHI. The resulting plasmid was designated as pMON42, as shown on Figure 8. 

Plasmid pMON42 was digested with BamHI and Rsal, and a 720 bp fragment containing the desired 
NOS 3' non-translated region was purified on a gel. The 720 bp fragment was digested with another 
endonuclease, Mbol, and treated with the large fragment of E coll DNA polymerase I. This resulted in a 260 

30 bp fragment with Mbol blunt ends, containing a large part ofthe~NOS 3' non-translated region including the 
poly-A signal. 

The foregoing procedure is represented in Figure 2 by step 58. However, it is recognized that alternate 
means could have been utilized; for example, it might have been possible to digest the Hindlll-23 fragment 
directly with Mbol to obtain the desired 260 bp fragment with the NOS 3' non-translated region. 

35 

Assembly of Chimeric Gene 

To complete the assembly of the chimeric gene, it was necessary to ligate the 260 bp Mbol fragment 
40 (which contained the NOS 3' non-translated region) to the 1.3 kb EcoRI-BamHI fragment from pMON58 
(which contained the NOS promoter region and 5' non-translated region and the NPT II structural 
sequence). In order to facilitate this ligation and control the orientation of the fragments, the Applicants 
decided to convert the Mbol ends of the 260 bp fragment into a BamHI end (at the 5' end of the fragment) 
and an EcoRI end (at the 3' end of the fragment). In order to perform this step, the Applicants used the 
45 following method. 

The 260 bp Mbol fragment, the termini of which had been converted to blunt ends by Klenow 
polymerase, was inserted into M13 mp8 DNA at a Smal cleavage site. The Smal site is surrounded by a 
variety of other cleavage sites present in the M13 mp8 DNA, as shown in Figure 8. The Mbol fragment 
could be inserted into the blunt Smal ends in either orientation. The orientation of the Mbol fragments in 
so different clones were tested, using Hinfl cleavage sites located assymetrically within the Mbol fragment. A 
clone was selected in which the 3' end of the NOS 3" non-translated region was located near the EcoRI 
cleavage site in the M13 mp8 DNA. This clone was designated as the M-2 clone, as shown in Figure 8. 

Replicative form (double stranded) DNA from the M-2 clone was digested by EcoRI and BamHI and a 
280 bp fragment was isolated. Separately, plasmid pMON58 was digested by EcoRI and BamHI, and a 
55 1 300 bp fragment was isolated. The two fragments were ligated, as shown in Figure 9, to complete the 
assembly of a NOS-NPTII-NOS chimeric gene having EcoRI ends. 

There are a variety of ways to control the ligation of the two fragments. For example, the two EcoRI- 
BamHI fragments could be joined together with DNA ligase and cleaved with EcoRI. After inactivation of 



15 



EP 0 131 623 B1 



EcoRI, a vector molecule having EcoRI ends that were treated with calf alkaline phosphatase (CAP) may be 
added to the mixture. The fragments in the mixture may be ligated in a variety of orientations. The plasmid 
mixture is used to transform E. coli, and cells having plasmids with the desired orientation are selected or 
screened, as described below. 

5 A plasmid, designated as pMON38, was created by insertion of the HindlII-23 fragment (from Ti plasmid 
pTiT37) into the Hindlll cleavage site of the plasmid pBR327. Plasmid pMON38 contains a unique EcoRI 
site, and an ampicillin-resistance gene which is expressed in E. coli. Plasmid pMON38 was cleaved with 
EcoRI and treated with alkaline phosphatase to prevent it frorrTre-ligating to itself. U.S. Patent 4,264,731 
(Shine, 1981). The resulting fragment was mixed with the 1300 bp NOS-NPTII fragment from pMON58, and 

w the 280 bp NOS fragment from M-2, which had been ligated and EcoRI-cleaved as described in the 
previous paragraph. The fragments were ligated, and inserted into E. coli. The E. coli cells which had 
acquired intact plasmids with ampicillin-resistance genes were selected on plateT containing ampicillin. 
Several clones were selected, and the orientation of the inserted chimeric genes was evaluated by means 
of cleavage experiments. Two clones having plasmids carrying NOS-NPT ll-NOS inserts with opposite 

15 orientations were selected and designated as pMON75 and pMON76, as shown in Figure 9. The chimeric 
gene may be isolated by digesting either pMON75 or pMON76 with EcoRI and purifying a 1580 bp 
fragment. 

The foregoing procedure is represented on Figure 2 by step 60. 

This completes the discussion of the NOS-NPTII-NOS chimeric gene. Additional information on the 
20 creation of this gene is provided in the Examples. A copy of this chimeric gene is contained in plasmid 
PMON128; it may be removed from pMON128 by digestion with EcoRI. A culture of E. coli containing 
pMON128 has been deposited with the American Type Culture Collection; this culture has been assigned 
accession number 39264. 

To prove the utility of this chimeric gene, the Applicants inserted it into plant cells. The NPTII structural 
25 sequence was expressed in the plant cells, causing them and their descendants to acquire resistance to 
concentrations of kanamycin which are normally toxic to plant cells. 



Creation of NPT I Chimeric Gene 

30 

In an alternate preferred embodiment of this invention, a chimeric gene was created comprising (1) a 
NOS promoter region and 5' non-translated region, [2) a structural sequence which codes for NPT I, and (3) 
a NOS 3' non-translated region. 

NPT I and NPT II are different and distinct enzymes with major differences in their amino acid 
35 sequences and substrate specificities. See, e.g., E. Beck et al, 1982. The relative stabilities and activities of 
these two enzymes in various types of plant cells are not yet fully understood, and NPT I may be preferable 
to NPT II for use in certain types of experiments and plant transformations. 

A 1200 bp fragment containing an entire NPT I gene was obtained by digesting pACY177 (Chang and 
Cohen, 1978) with the endonuclease, Avail. The Avail termini were converted to blunt ends with Klenow 
40 polymerase, and converted to BamHI termini using a synthetic linker. This fragment was inserted into a 
unique BamHI site in a pBR327-derived plasmid, as shown in Figure 11. The resulting plasmid was 
designated as pMON66. 

Plasmid pMON57 (a deletion derivative of pBR327, as shown in Figure 11) was digested with Avail. The 
225 bp fragment of pMON57 was replaced by the analogous 225 bp Avail fragment taken from plasmid 

45 pUC8 (Vieira and Messing, 1982), to obtain a derivative of pMON57 with no Pstl cleavage sites. This 
plasmid was designated as pMON67. 

Plasmid pMON58 (described previously and shown in Figure 7) was digested with EcoRI and BamHI to 
obtain a 1300 bp fragment carrying the NOS promoter and the NPT II structural sequence. This fragment 
was inserted into pMON67 which had been digested with EcoRI and BamHI. The resulting plasmid was 

so designated as pMON73, as shown in Figure 12. 

pMON73 was digested with Pstl and BamHI, and a 2.4 kb fragment was isolated containing a NOS 
promoter region and 5' non-translated region. Plasmid pMON66 (shown on Figure 11) was digested with 
Xhol and BamHI to yield a 950 bp fragment containing the structural sequence of NPT I. This fragment 
lacked about 30 nucleotides at the 5' end of the structural sequence. A synthetic linker containing the 

55 missing bases, having appropriate Pstl and Xhol ends, was created. The pMON73 fragment, the pMON66 
fragment, and the synthetic linker were ligated together to obtain plasmid pMON78, as shown in Figure 13. 
This plasmid contains the NOS promoter region and 5' non-translated region joined to the NPT I structural 
sequence. The ATG start codon was in the same position that the ATG start codon of the NOS structural 
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sequence had occupied. 

Plasmid pMON78 was digested with EcoRI and BamHI to yield a 1300 bp fragment carrying the 
chimeric NOS-NPT I regions. Doyle-stranded DNA from the M-2 clone (described previously and shown on 
Figure 9) was digested with EcoRI and BamHI, to yield a 280 bp fragment carrying a NOS 3' non-translated 
5 region with a poly-adenylation signal. The two fragments described above were ligated together to create 
the NOS-NPT l-NOS chimeric gene, which was inserted into plasmid pMON38 (described above) which had 
been digested with EcoRI. The two resulting plasmids, having chimeric gene inserts with opposite 
orientations, were designated as pMON106 and pMON107, as shown in Figure 14. 

Either of plasmids pMON106 or pMON107 may be digested with EcoRI to yield a 1.6 kb fragment 
70 containing the chimeric NOS-NPT l-NOS gene. This fragment was inserted into plasmid pMON120 which 
had been digested with EcoRI and treated with alkaline phosphatase. The resulting plasmids, having inserts 
with opposite orientations, were designated as pMON130 and pMON131, as shown on Figure 15. 

The NOS-NPT l-NOS chimeric gene was inserted into plant cells, which acquired resistance to 
kanamycin. This demonstrates expression of the chimeric gene in plant cells. 

15 

Creation of Chimeric Gene with Soybean Promoter 

In an alternate preferred embodiment of this invention, a chimeric gene was created comprising (1) a 
20 promoter region and 5' non-translated region taken from a gene which naturally exists in soybean; this gene 
codes for the small subunit of ribulose-1 ,5-bis -phosphate carboxylase (sbss, for soybean small subunit); (2) 
a structural sequence which codes for NPTlTand (3) a NOS 3' non-translated region. 

The sbss gene codes for a protein in soybean leaves which is involved in photosynthetic carbon 
fixation. The sbss protein is the most abundant protein in soybean leaves (accounting for about 10% of the 
25 total leaf protein), so it is likely that the sbss promoter region causes prolific transcription. 

There are believed to be approximately six genes encoding the sbss protein in the soybean genome. 
One of the members of the sbss gene family, SRS1, which is highly transcribed in soybean leaves, has 
been cloned and characterized. The promoter region, 5' nontranslated region, and a portion of the structural 
sequence are contained on a 2.1 kb EcoRI fragment that was subcloned into the EcoRI site of plasmid 
30 pBR325 (Bolivar, 1978). The resultant plasmid, pSRS2.1, was a gift to Monsanto Company from Dr. R. B. 
Meagher, University of Georgia, Athens, CA. The 2.1 kb EcoRI fragment from pSRS2.1 is shown on Figure 
16. 

Plasmid pSRS2.1 was prepared from dam-E. coji cells, and cleaved with Mbol to obtain an 800 bp 
fragment. This fragment was inserted into plasmid pKC7 (Rao and Rogers, 1979) which had been cleaved 

35 with Bglll. The resulting plasmid was designated as pMON121, as shown on Figure 17. 

Plasmid pMON121 was digested with EcoRI and Bell, and a 1200 bp fragment containing the sbss 
promoter region was isolated. Separately, plasmid pMON75 (described previously and shown on Figure 9) 
was digested with EcoRI and Bglll, and a 1250 bp fragment was isolated, containing a NPT II structural 
sequence and a NOS 3' non-translated region. The two fragments were ligated at the compatible Bcll/Bglll 

40 overhangs, to create a 2450 bp fragment containing sbss-NPT ll-NOS chimeric gene. This fragment was 
inserted into pMON120 which had been cleaved with EcoRI, to create two plasmids having chimeric gene 
inserts with opposite orientations, as shown in Figure 18. The plasmids were designated as pMON141 and 
PMON142. 

The sbss-NPTII-NOS chimeric genes were inserted into several types of plant cells, causing the plant 
45 cells to acquire resistance to kanamycin. 

This successful transformation proved that a promoter region from one type of plant can cause the 
expression of a gene within plant cells from an entirely different genus, family, and order of plants. 

The chimeric sbss-NPT ll-NOS gene also had another significant feature. Sequencing experiments 
indicated that the 800 bp Mbol fragment contained the ATG start codon of the sbss structural sequence. 
50 Rather than remove this start codon, the Applicants decided to insert a stop codon behind it in the same 
reading frame. This created a dicistronic mRNA sequence, which coded for a truncated amino portion of the 
sbss polypeptide and a complete NPT II polypeptide. Expression of the NPT II polypeptide was the first 
proof that a dicistronic mRNA can be translated within plant cells. 

The sbss promoter is contained in plasmid pMON154, described below. A culture of E. coli containing 
55 this plasmid has been deposited with the American Type Culture Collection. This culture has been assigned 
accession number 39265. 
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Creation of BGH Chimeric Genes 

In an alternate preferred embodiment of this invention, a chimeric gene was created comprising (1) a 
sbss promoter region and 5' non-translated region, (2) a structural sequence which codes for bovine growth 
hormone (BGH) and (3) a NOS 3' non-translated region. This chimeric gene was created as follows. 

A structural sequence which codes for the polypeptide, bovine growth hormone, (see, e.g., Woychik et 
al, 1982) was inserted into a pBR322-derived plasmid. The resulting plasmid was designated as plasmid 
CF-1 . This plasmid was digested with EcoRI and Hindlll to yield a 570 bp fragment containing the structural 
sequence. Double stranded M-2 RF DNA (described previously and shown in Figure 8) was cleaved with 
EcoRI and Hindlll to yield a 290 bp fragment which contained the NOS 3' non-translated region with a poly- 
adenylation signal. The two fragments were ligated together and digested with EcoRI to create an 860 base 
pair fragment with EcoRI ends, which contained a BGH-coding structural sequence joined to the NOS 3' 
non-translated region. This fragment was introduced into plasmid pMON38, which had been digested with 
EcoRI and treated with alkaline phosphatase, to create a new plasmid, designated as pMON 108, as shown 
in Figure 19. 

A unique Bglll restruction site was introduced at the 5' end of the BGH structural sequence by digesting 
pMON 108 with EcoRI to obtain the 860 bp fragment, and using Klenow polymerase to create blunt ends on 
the resulting EcoRI fragment. This fragment was ligated into plasmid N25 (a derivative of pBR327 
containing a synthetic linker carrying Bglll and Xbal cleavage sites inserted at the BamHI site), which had 
been cleaved with Xbal and treated with Klenow polymerase to obtain blunt ends (N25 contains a unique 
Bglll site located 12 bases from the Xbal site). The resulting plasmid, which contained the 860bp BGH-NOS 
fragment in the orientation shown in Figure 20, was designated as plasmid N25-BGH. This plasmid contains 
a unique Bglll cleavage site located about 25 bases from the 5' end of the BGH structural sequence. 

Plasmid N25-BGH prepared from dam- E. coli cells was digested with Bglll and Clal to yield an 860 bp 
fragment which contained the BGH structural sequence joined to the NOS 3' non-translated region. 
Separately, plasmid pMON121 (described previously and shown in Figure 17) was prepared from dam- E. 
coli cells and was digested with Clal and Bell to create an 1100 bp fragment which contained the sbss 
promoter region. The fragments were ligated at their compatible Bcll/Bglll overhangs, and digested with Clal 
to yield a Clal fragment of about 2 kb containing the chimeric sbss-BGH-NOS gene. This fragment was 
inserted into pMON120 (described previously and shown in Figure 10) which had been digested with Clal. 
The resulting plasmids, containing the inserted chimeric gene in opposite orientations were designated 
pMON147 and pMON148, as shown in Figure 21. 

An alternate chimeric BGH gene was created which contained (1) a NOS promoter region and 5' non- 
translated region, (2) a structural sequence which codes for BGH, and (3) a NOS 3' non-translated region, 
by the following method, shown in Figure 22. 

Plasmid pMON76 (described above and shown in Figure 9) was digested with EcoRI and Bglll to obtain 
a 308 bp fragment containing a NOS promoter region and 5' non-translated region. Plasmid N25-BGH 
prepared from dam- E. coli cells (described above and shown in Figure 20) was digested with Bglll and Clal 
to obtain a 900 bp fragment containing a BGH structural sequence and a NOS 3' non-translated region. 
These two fragments were ligated together to obtain a chimeric NOS-BGH-NOS gene in a fragment with 
EcoRI and Clal ends. This fragment was ligated with an 8 kb fragment obtained by digesting pMON120 with 
EcoRI and Clal. The resulting plasmid, designated as pMON149, is shown in Figure 22. 



Creation of Chimeric NOS-EPSP-NOS Gene 

In an alternate preferred embodiment, a chimeric gene was created comprising (1) a NOS promoter 
region and 5' non-translated region, (2) a structural sequence which codes for the E. coli enzyme, 5-enol 
pyruvyl shikimate-3-phosphoric acid synthase (EPSP synthase) and (3) a NOS 3' nonTra~nsTated region. 

EPSP synthase is believed to be the target enzyme for the herbicide, glyphosate, which is marketed by 
Monsanto Company under the registered trademark, "Roundup." Glyphosate is known to inhibit EPSP 
synthase activity (Amrhein et al, 1980), and amplification of the EPSP synthase gene in bacteria is known to 
increase their resistance to glyphosate. Therefore, increasing the level of EPSP synthase activity in plants 
may confer resistance to glyphosate in transformed plants. Since glyphosate is toxic to most plants, this 
provides for a useful method of weed control. Seeds of a desired crop plant which has been transformed to 
increase EPSP synthase activity may be planted in a field. Glyphosate may be applied to the field at 
concentrations which will kill all non-transformed plants, leaving the non-transformed plants unharmed. 

An EPSP synthase gene may be isolated by a variety of means, including the following. A lambda 



18 



EP 0 131 623 B1 



phage library may be created which carries a variety of DNA fragments produced by Hindill cleavage of E. 

coli DNA. See, e.g., Maniatis et al, 1982. 

The EPSP synthase gene is one of the genes which are involved in the production of aromatic amino 

acids. These genes are designated as the "aro" genes; EPSP synthase is designated as aroA . Cells which 
5 do not contain functional aro genes are designated as aro- cells. Aro- cells must normally be grown on 

media supplemented by aromatic amino acids. See Pittard and Wallis, 1966. 

Different lambda phages which carry various Hindill fragments may be used to infect mutant E. coli 

cells which do not have EPSP synthase genes. The infected aro- cells may be cultured on media which 

does not contain the aromatic amino acids, and transformed aro+ clones which are capable of growing on 
70 such media may be selected. Such clones are likely to contain the EPSP synthase gene. Phage particles 

may be isolated from such clones, and DNA may be isolated from these phages. The phage DNA may be 

cleaved with one or more restriction endonucleases, and by a gradual process of analysis, a fragment which 

contains the EPSP synthase gene may be isolated. 

Using a procedure similar to the method summarized above, the Applicants isolated an 11 kb Hindill 
75 fragment which contained the entire E. coli EPSP synthase gene. This fragment was digested with Bglll to 

produce a 3.5 kb Hindlll-Bglll fragmenFwhich contained the entire EPSP synthase gene. This 3.5 kb 

fragment was inserted into plasmid pkC7 (Rao and Rogers, 1979) to produce plasmid pMON4, whch is 

shown in Figure 23. 

Plasmid pMON4 was digested with Clal to yield a 2.5 kb fragment which contained the EPSP synthase 
20 structural sequence. This fragment was inserted into pBR327 that had been digested with Clal, to create 
pMON8, as shown in Figure 23. 

pMON8 was digested with BamHI and Ndel to obtain a 4.9 kb fragment. This fragment lacked about 
200 nucleotides encoding the amino terminus of the EPSP synthase structural sequence. 

The missing nucleotides were replaced by ligating a Hinfl/Ndel fragment, obtained from pMONS as 
25 shown in Figure 24, together with a synthetic oligonucleotide sequence containing (1) the EPSP synthase 
start codon and the first three nucleotides, (2) a unique Bglll site, and (3) the appropriate BamHI and Hinfl 
ends. The resulting plasmid, pMON25, contains an intact EPSP synthase structural sequence with unique 
BamHI and Bglll sites positioned near the start codon. 

Double stranded M-2 DNA (described previously and shown in Figure 8) was digested with Hindill and 
30 EcoRI to yield a 290 bp fragment which contains the NOS 3' non-translated region and poly-adenylation 
signal. This fragment was introduced into a pMON25 plasmid that had been digested with EcoRI and Hindill 
to create a plasmid, designated as pMON146 (shown in Figure 25) which contains the EPSP structural 
sequence joined to the NOS 3' non-translated region. 

pMON146 was cleaved with Clal and Bglll to yield a 2.3 kb fragment carrying the EPSP structural 
35 sequence joined to the NOS 3' non-translated region. pMON76 (described previously and shown in Figure 
9) was digested with Bglll and EcoRI to create a 310 bp fragment containing the NOS promoter region and 
5' non-translated region. The above fragments were mixed with pMON120 (described previously and shown 
in Figure 10) that had been digested with Clal and EcoRI, and the mixture was ligated. The resulting 
plasmid, designated pMON153, is shown in Figure 26. This plasmid contains the chimeric NOS-EPSP-NOS 
40 gene. 

A plasmid containing a chimeric sbss-EPSP-NOS gene was prepared in the following manner, shown in 
Figure 27. Plasmid pMON146 (described previously and shown in Figure 25) was digested with Clal and 
Bglll, and a 2.3 kb fragment was purified. This fragment contained the EPSP synthase structural sequence 
coupled to a NOS 3' non-translated region with a poly-adenylation signal. Plasmid pMON121 (described 

45 above and shown in Figure 17) was digested with Clal and Bell, and a 1.1 kb fragment was purified. This 
fragment contains an sbss promoter region and 5' non-translated region. The two fragments were mixed 
and ligated with T4 DNA ligase and subsequently digested with Clal. This created a chimeric sbss-EPSP- 
NOS gene, joined through compatible Bglll and Bell termini. This chimeric gene with Clal termini was 
inserted into plasmid pMON120 which had been digested with Clal and treated with calf alkaline 

50 phosphatase (CAP). The mixture was ligated with T4 DNA ligase. The resulting mixture of fragments and 
plasmids was used to transform E. coli ceils, which were selected for resistance to spectinomycin. A colony 
of resistant cells was isolated, arTdThe plasmid in this colony was designated as pMON154, as shown in 
Figure 27. 

A culture of E. coli containing pMON154 has been deposited with the American Type Culture Center. 
55 This culture has beeTTassigned accession number 39265. 



Creation of CaMV(19S)-NPT ll-NOS Genes 
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In an alternate preferred embodiment of this invention, a chimeric gene was created which contained 
the following elements: 

1 . a promoter region and a 5' non-translated region derived from the CaM V (1 9S) gene, which codes for 
the P66 protein, 

5 2. a partial coding sequence from the CaMV (19S) genes including an ATG start codon and several 
internal ATG sequences, all of which were in the same frame as a TGA termination sequence immediately 
inside the desired ATG start codon of the NPTII gene; 

3. a structural sequence derived from a neomycin phosphotransferase II (NPT II) gene; this sequence 
was preceded by a spurious ATG sequence, which was in the same reading frame as a TGA sequence 

w within the NPT II structural sequence; and, 

4. A 3' non-translated region, including a poly-adenylation signal, derived from a nopaline synthase 
(NOS) gene. 

This chimeric gene, referred to herein as the CaMV(19S)-NPTII-NOS gene, was inserted into plasmid 
pMON120 to create a plasmid designated as pMON156, shown in Figure 29 and described in Example 9. 
75 Plasmid pMON156 was inserted in A. tumefaciens cells, where it formed a co-integrate Ti plasmid by 
means of a single crossover event with a Ti plasmid in the A. tumefaciens cell. The chimeric gene in the 
co-integrate plasmid was within a modified T-DNA region in the Ti plasmid, surrounded by left and right T- 
DNA borders. 

A similar chimeric gene was created and assembled in a plasmid designated as pMON155, shown in 
20 Figure 32 and described in Example 10. This chimeric gene resembled the gene in pMON156, with two 
exceptions: 

1. an oligonucleotide linker having stop codons in ali three reading frames was inserted between the 
CaMV(19S) partial structural sequence and the NPT II structural sequence; and, 

2. the spurious ATG sequence on the 5' side of the NPT II structural sequence was deleted. 

25 The construction of this chimeric gene is described in Example 10. This gene was inserted into A. 
tumefaciens cells and subsequently into plant cells. Its level of expression was apparently higher than the 
expression of the similar gene in pMON156, as assayed by growth on higher concentrations of kanamycin. 
A. tumefaciens cells containing co-integrate Ti::pMON155 plasmids have been deposited with the American 
Type Culture Center, and have been assigned ATCC accession number 39336. 



Creation of Chimeric CaMV(32S)-NPT ll-NOS Genes 

In an alternate preferred embodiment of this invention, a chimeric gene was created comprising 
35 (1) a promoter region which causes transcription of the 32S CaMV mRNA; 

(2) a structural sequence which codes for NPT II; and 

(3) a NOS 3' non-translated region. 

The assembly of this chimeric gene is described in Example 11 and Figures 33 through 37. This gene 
was inserted into plant cells and it caused them to become resistant to kanamycin. 
40 Petunia plants cannot normally be infected by CaMV. Those skilled in the art may determine through 
routine experimentation whether any particular plant viral promoter (such as the CaMV promoter) will 
function at satisfactory levels in any particular type of plant cell, including plant cells that are outside of the 
normal host range of the virus from which the promoter was derived. 

45 

Means for Inserting Chimeric Genes Into Plant Cells 

A variety of methods are known for inserting foreign DNA into plant cells. One such method, utilized by 
the Applicants, involved inserting a chimeric gene into Ti plasmids carried by A. tumefaciens, and co- 

50 cultivating the A. tumefaciens cells with plants. A segment of T-DNA carrying The chimeric gene was 
transferred into the plant genome, causing transformation. This method is described in detail in two 
separate U.S. patent applications entitled "Plasmids for Transforming Plant Cells," serial number 458,411, 
(WO84/02919) and "Genetically Transformed Plants," serial number 458,402, (WO84/02920) both of which 
were filed on January 17, 1983. 

55 A variety of other methods are listed below. These methods are theoretically capable of inserting the 
chimeric genes of this invention into plant cells, although the reported transformation efficiencies achieved 
to date by such methods have been low. The chimeric genes of this invention (especially those chimeric 
genes such as NPT I and NPT II, which may be utilized as selectable markers) are likely to facilitate 
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research on methods of inserting DNA into plants or plant cells. 

1. One alternate technique for inserting DNA into plant cells involves the use of lipid vesicles, also 
called liposomes. Liposomes may be utilized to encapsulate one or more DNA molecules. The liposomes 
and their DNA contents may be taken up by plant cells; see, e.g., Lurquin, 1981. If the inserted DNA can be 

5 incorporated into the plant genome, replicated, and inherited, the plant cells will be transformed. 

To date, efforts to use liposomes to deliver DNA into plant cells have not met with great success 
(Fraley and Papahadjopoulos, 1981). Only relatively small DNA molecules have been transferred into plant 
cells by means of liposomes, and none have yet been expressed. However, liposome-delivery technology is 
still being actively developed, and it is likely that methods will be developed for transferring plasmids 

w containing the chimeric genes of this invention into plant cells by means involving liposomes. 

2. Other alternate techniques involve contacting plant cells with DNA which is complexed with either (a) 
polycationic substances, such as poly-L-ornithine (Davey et al, 1980), or (b) calcium phosphate (Krens et al, 
1982). Although efficiencies of transformation achieved to date have been low, these methods are still being 
actively researched. 

15 3. A method has been developed involving the fusion of bacteria, which contain desired plasmids, with 
plant cells. Such methods involve converting the bacteria into spheroplasts and converting the plant cells 
into protoplasts. Both of these methods remove the cell wall barrier from the bacterial and plant cells, using 
enzymic digestion. The two cell types can then be fused together by exposure to chemical agents, such as 
polyethylene glycol. See Hasezawa et al, 1981. Although the transformation efficiencies achieved to date by 

20 this method have been low, similar experiments using fusions of bacterial and animal cells have produced 
good results; see Rassoulzadegan et al, 1982. 

4. Two other methods which have been used successfully to genetically transform animal cells involve 
(a) direct microinjection of DNA into animal cells, using very small glass needles (Capecchi, 1980), and (b) 
electric-current-induced uptake of DNA by animal cells (Wong and Neumann, 1982). Although neither of 

25 these techniques have been utilized to date to transform plant cells, they may be useful to insert chimeric 
genes of this invention into plant cells. 



Meaning of Various Phrases 

A variety of phrases which are used in the claims must be defined and described to clarify the meaning 
and coverage of the claims. 

The meaning of any particular term shall be interpreted with reference to the text and figures of this 
application. In particular, it is recognized that a variety of terms have developed which are used 

35 inconsistently in the literature. For example, a variety of meanings have evolved for the term "promoter," 
some of which include the 5' non-translated region and some of which do not. In an effort to avoid problems 
of interpretation, the Applicants have attempted to define various terms. However, such definitions are not 
presumed or intended to be comprehensive and they shall be interpreted in light of the relevant literature. 
The term "chimeric gene" refers to a gene that contains at least two portions that were derived from 

40 different and distinct genes. As used herein, this term is limited to genes which have been assembled, 
synthesized, or otherwise produced as a result of man-made efforts, and any genes which are replicated or 
otherwise derived therefrom. "Man-made efforts" include enzymatic, cellular, and other biological pro- 
cesses, if such processes occur under conditions which are caused, enhanced, or controlled by human 
effort or intervention; this excludes genes which are created solely by natural processes. 

45 As used herein, a "gene" is limited to a segment of DNA which is normally regarded as a gene by 
those skilled in the art. For example, a plasmid might contain a plant-derived promoter region and a 
heterologous structural sequence, but unless those two segments are positioned with respect to each other 
in the plasmid such that the promoter region causes the transcription of the structural sequence, then those 
two segments would not be regarded as included in the same gene. 

so This invention relates to chimeric genes which have structural sequences that are "heterologous" with 
respect to their promoter regions. This includes at least two types of chimeric genes: 

1 . DNA of a gene which is foreign to a plant cell. For example, if a structural sequence which codes for 
mammalian protein or bacterial protein is coupled to a plant promoter region, such a gene would be 
regarded as heterologous. 

55 2. A plant cell gene which is naturally promoted by a different plant promoter region. For example, if a 
structural sequence which codes for a plant protein is normally controlled by a low-quantity promoter, the 
structural sequence may be coupled with a prolific promoter. This might cause a higher quantity of 
transcription of the structural sequence, thereby leading to plants with higher protein content. Such a 
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structural sequence would be regarded as heterologous with regard to the prolific promoter. 

However, it is not essential for this invention that the entire structural sequence be heterologous with 
respect to the entire promoter region. For example, a chimeric gene of this invention may be created which 
would be translated into a "fusion protein", i.e., a protein comprising polypeptide portions derived from two 
5 separate structural sequences. This may be accomplished by inserting all or part of a heterologous 
structural sequence into the structural sequence of a plant gene, somewhere after the start codon of the 
plant structural sequence. 

As used herein, the phrase, "a promoter region derived from a specified gene" shall include a promoter 
region if one or more parts of the promoter region were derived from the specified gene. For example, it 

10 might be discovered that one or more portions of a particular plant-derived promoter region (such as 
intervening region 8, shown on Figure 1) might be replaced by one or more sequences derived from a 
different gene, such as the gene that contains the heterologous structural sequence, without reducing the 
expression of the resulting chimeric gene in a particular type of host cell. Such a chimeric gene would 
contain a plant-derived association region 2, intervening region 4, and transcription initiation sequence 6, 

75 followed by heterologous intervening region 8, 5' non-translated region 10 and structural sequence 14. Such 
a chimeric gene is within the scope of this invention. 

As used herein, the phrase "derived from" shall be construed broadly. For example, a structural 
sequence may be "derived from" a particular gene by a variety of processes, including the following: 

1 . the gene may be reproduced by various means such as inserting it into a plasmid and replicating the 
20 plasmid by cell culturing, in vitro replication, or other methods, and the desired sequence may be obtained 

from the DNA copies by various means such as endonuclease digestion; 

2. mRNA which was coded for by the gene may be obtained and processed in various ways, such as 
preparing complementary DNA from the mRNA and then digesting the cDNA with endonucleases; 

3. the sequence of bases in the structural sequence may be determined by various methods, such as 
25 endonuclease mapping or the Maxam-Gilbert method. A strand of DNA which duplicates or approximates 

the desired sequence may be created by various methods, such as chemical synthesis or ligation of 
oligonucleotide fragments. 

4. a structural sequence of bases may be deduced by applying the genetic code to the sequence of 
amino acid residues in a polypeptide. Usually, a variety of DNA structural sequences may be determined 

30 for any polypeptide, because of the redundancy of the genetic code. From this variety, a desired sequence 
of bases may be selected, and a strand of DNA having the selected sequence may be created. 

If desired, any DNA sequence may be modified by substituting certain bases for the existing bases. 
Such modifications may be performed for a variety of reasons. For example, one or more bases in a 
sequence may be replaced by other bases in order to create or delete a cleavage site for a particular 

35 endonuclease. As another example, one or more bases in a sequence may be replaced in order to reduce 
the occurrence of "stem and loop" structures in messenger RNA. Such modified sequences are within the 
scope of this invention. 

A structural sequence may contain introns and exons; such a structural sequence may be derived from 
DNA, or from an mRNA primary transcript. Alternately, a structural sequence may be derived from 
40 processed mRNA, from which one or more introns have been deleted. 

The Applicants have deposited two cultures of E. coli cells containing plasmids pMON128 and 
PMON154 with the American Type Culture Collection (ATCC). These cells have been assigned ATCC 
accession numbers 39264 and 39265, respectively. 

Those skilled in the art will recognize, or be able to ascertain using no more than routine experimenta- 
45 tion, numerous equivalents to the specific embodiments described herein. Such equivalents are within the 
scope of this invention. 



EXAMPLES 

50 

Example 1 Creation of pMON1001 

Fifty micrograms (ug) of lambda phage bbkan-1 DNA (Berg et al, 1975) were digested with 100 units of 
55 Hindlll (all restriction endonucleases were obtained from New England Biolabs, Beverly, MA, and were used 
with buffers according to the suppliers instructions, unless otherwise specified) for 2 hr at 37° C. After heat- 
inactivation (70° C, 10 min), the 3.3 kb Tn5 Hindlll fragment was purified on a sucrose gradient. One ug of 
the purified Hindlll fragment was digested with BamHI (2 units, 1 hr, 37° C), to create a 1.8 kb fragment. 
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The endonuclease was heat inactivated. 

Plasmid pBR327 (Soberon et al, 1981), 1 ug, was digested with Hindlll and BamHI (2 units each, 2 hr, 
37° C). Following digestion, the endonucleases were heat inactivated and the cleaved pBR327 DNA was 
added to the BamHI-Hindlll Tn5 fragments. After addition of ATP to a concentration of 0.75mM, 10 units of 
T4 DNA ligase (prepared by the method of Murray et al, 1979) was added, and the reaction was allowed to 
continue for 16 hours at 12-14° C. One unit of T4 DNA ligase will give 90% circularization of one ug of 
Hindlll-cleaved pBR327 plasmid in 5 minutes at 22° C. 

The ligated DNA was used to transform CaCI 2 -shocked E. coli CB00 rec A56 cells (Maniatis et al, 
1982). After expression in Luria broth (LB) for 1 hour at 37° C _ the _ cells weTe" spread on solid LB media 
plates containing 200 ug/ml ampicillin and 40 ug/ml kanamycin. Following 16 hour incubation at 37° C, 
several hundred colonies appeared. Plasmid mini-prep DNA was prepared from six of these. (Ish-Horowicz 
and Burke, 1981). Endonuclease digestion showed that all six of the plasmids carried the 1.8 kb Hindlll- 
BamHI fragment. One of those isolates was designated as pMON1001 as shown in Figure 6. 



Example 2: Creation of pMON40 

Five ug of plasmid pMON1001 (described in Example 1) was digested with Smal. The reaction was 
terminated by phenol extraction, and the DNA was precipitated by ethanol. A BamHI linker CCGGATCCGG 
(0.1 ug), which had been phosphorylated with ATP and T4 polynucleotide kinase (Bethesda Research 
Laboratory, Rockville, MD) was added to 1 ug of the pMON1001 fragment. The mixture was treated with T4 
DNA ligase (100 units) for 18 hours at 14° C. After heating at 70° C for 10 min to inactivate the DNA ligase, 
the DNA mixture was digested with BamHI endonuclease (20 units, 3 hours, 37° C) and separated by 
electrophoresis on an 0.5% agarose gel. The band corresponding to the 4.2 kb Smal-BamHI vector 
fragment was excised from the gel. The 4.2 kb fragment was purified by adsorption on glass beads 
(Vogelstein and Gillespie, 1979), ethanol precipitated and resuspended in 20 ul of DNA ligase buffer with 
ATP. T4 DNA ligase (20 units) was added and the mixture was incubated for 1 .5 hours at room temperature. 
The DNA was mixed with rubidium chloride-shocked E. coli C600 cells for DNA transformation (Maniatis et 
al, 1982). After expression for 1 hour at 37 °C in LB, the cells were spread on LB plates containing 200 
ug/ml of ampicillin and 20 ug/ml kanamycin. The plates were incubated at 37° C for 16 hours. Twelve 
ampicillin-resistant, kanamycin-resistant colonies were chosen, 2 ml cultures were grown, and mini-plasmid 
preparations were performed. Endonuclease mapping of the plasmids revealed that ten of the twelve 
contained no Smal site and a single BamHI site, and were of the appropriate size, 4.2 kb. The plasmid from 
one of the ten colonies was designated as pMON40, as shown in Figure 6. 



Example 3: Creation of NOS Promoter Fragment 

An oligonucleotide with the following sequence, 5'-TGCAGATTATTTGG-3', was synthesized (Beaucage 
and Carruthers, 1981, as modified by Adams et al, 1982). This oligonucleotide contained a 32 P radioactive 
label, which was added to the 5' thymidine residue by polynucleotide kinase. 

An M13 mp7 derivative, designated as SIA, was given to Applicants by M. Bevan and M.-D. Chilton, 
Washington University, St. Louis, MO. To the best of Applicants' knowledge and belief, the SIA DNA was 
obtained by the following method. A pTiT37 piasmid was digested with Hindlll, and a 3.4 kb fragment was 
isolated and designated as the Hindlll-23 fragment. This fragment was digested with Sau3a, to create a 344 
bp fragment with Sau3a ends. This fragment was inserted into double-stranded, replicative form DNA from 
the M13 mp7 phage vector (Messing et al, 1981) which had been cut with BamHI. Two recombinant phages 
with 344 bp inserts resulted, one of which contained the anti-sense strand of the NOS promoter fragment. 
That recombinant phage was designated as SIA, and a clonal copy was given to the Applicants. 

The Applicants prepared the single-stranded form of the SIA DNA (14.4 ug; 6 pmol), and annealed it (10 
minutes at 70° C, then cooled to room temperature) with 20 pmol of the 14-mer oligonucleotide, mentioned 
above. The oligonucleotide annealed to the Sau3a insert at bases 286-300 as shown on Figures 4 and 5. 

200 ul of the SIA template and annealed oligonucleotide were mixed with the four dNTP's (present at a 
final concentration of 1mM, 25 ul) and 50 ul of Klenow polymerase. The mixture incubated for 30 minutes at 
room temperature. During this period, the polymerase added dNTP's to the 3' end of the oligonucleotide. 
The polymerase was heat-inactivated (70° C, 3 min), and Haelll (160 units) were added. The mixture was 
incubated (1 hour, 55° C), the Haelll was inactivated (70° C, 3 min), and the four dNTP's (1mM, 12 ul) and 
T4 DNA polymerase (50 units) were added. The mixture was incubated (1 hour, 37° C) and the polymerase 
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was inactivated (70° C, 3 min). This yielded a fragment of about 570 bp. EcoRI (150 units) was added, the 
mixture was incubated (1 hour, 37° C) and the EcoRI was inactivated (70° C, 3 min). 

Aliquots of the mixture were separated on 6% polyacrylamide with 25% glycerol. Autoradiography 
revealed a radioactively labelled band about 310 bp in size. This band was excised. The foregoing 
procedure is indicated by Figure 5. 



Example 4: Creation of pMON58 

Five ug of plasmid pMON40 (described in Example 2) were digested with Bglli (10 units, 1.5 hour, 
37° C), and the Bglll was inactivated (70° C, 10 min). The four dNTP's (1mM, 5 ui) and Klenow polymerase 
(8 units) were added, the mixture was incubated (37° C, 40 min), and the polymerase was inactivated 
(70° C, 10 min). EcoRI (10 units) was added and incubated (1 hour, 37° C), and calf alkaline phosphatase 
(CAP) was added and incubated (1 hour, 37° C). A fragment of about 3.9 kb was purified on agarose gel 
using NA-45 membrane (Scheicher and Scheull, Keene NH). The fragment (1.0 pM) was mixed with the 
NOS promoter fragment (0.1 pM), described in Example 3, and with T4 DNA ligase (100 units). The mixture 
was incubated (4° C, 16 hr). The resulting plasmids were inserted into E. coli cells, which were selected on 
media containing 200 ug/ml ampicillin. Thirty-six clonal Amp R colonieTwere selected, and mini-preps of 
plasmids were made from those colonies. The plasmid from one colony demonstrated a 308 bp EcoRI-Bglll 
fragment, a new Sstll cleavage site carried by the 308 bp NOS fragment, and a new Pstl site. This plasmid 
was designated as pMON58, as shown in Figure 7. pMON58 DNA was prepared as described above. 



Example 5: Creation of PMON42 

Plasmid pBR325-Hindlll-23, a derivative of plasmid pBR325 (Bolivar, 1978) carrying the Hindlll-23 
fragment of pTIT37 (see Figure 3) in the Hindlll site, was given to Applicants by M. Bevan and M.-D. 
Chilton, Washington University, St. Louis, MO. DNA of this plasmid was prepared and 30 ug were digested 
with Hindlll (50 units) and BamHI (50 units). The 1.1 kb Hindlll-BamHI fragment was purified by adsorption 
on glass beads (Vogelstein and Gillespie, 1979) after agarose gel electrophoresis. The purified fragment 
(0.5 ug) was added to 0.5 ug^of the 2.9 kb Hindlll-BamHI fragment of pBR327. After treatment with DNA 
ligase (20 units, 4 hours, 22° C), the resulting plasmids were introduced to E. coli C600 ceils. Clones 
resistant to ampicillin at 200 ug/ml were selected on solid media; 220 clones wereobtained. Minipreps of 
plasmid DNA were made from six of these clones and tested with the presence of a 1.1 kb fragment after 
digestion with Hindlll and BamHI. One plasmid which demonstrated the correct insert was designated 
pMON42. Plasmid pMON42 DNA was prepared as described in previous examples. 



Example 6: Creation of M13 Clone M-2 

Seventy-five ug of plasmid pMON42 (described in Example 5) prepared from dam- E. coli cells were 
digested with Rsal and BamHI (50 units of each, 3 hours, 37° C) and the 720 bp Rsal-BamHilrigment was 
purified using NA-45 membrane. Eight ug of the purified 720 bp BamHI-Rsal fragment were digested with 
Mbol (10 min, 70° C), the ends were made blunt by filling in with the large Klenow fragment of DNA 
polymerase I and the four dNTP's. Then 0.1 ug of the resulting DNA mixture was added to 0.05 ug of M13 
mp8 previously digested with Smal (1 unit, 1 hour 37° C) and calf alkaline phosphatase (0.2 units). After 
ligation (10 units of T4 DNA ligase, 16 hours, 12° C) and transfection of E. coli JM101 cells, several hundred 
recombinant phage were obtained. Duplex RF DNA was prepared from "twelve recombinant, phage-carrying 
clones. The RE DNA (0.1 ug) was cleaved with EcoRI, (1 unit, 1 hour, 37° C), end-labeled with 32 P-dATP 
and Klenow polymerase, and re-digested with BamHI (I unit, 1 hour, 37° C). The EcoRI and BamHI sites 
span the Smal site. Therefore, clones containing the 260 bp Mbol fragment could be identified as yielding a 
labelled 270 bp fragment after electrophoresis on 6% polyacrylamide gels and autoradiography. Four of the 
twelve clones carried this fragment. The orientation of the insert was determined by digestion of the EcoRI- 
cleaved, end-labeled RF DNA (0.1 ug) with Hinfl (1 unit, 1 hour, 37° C). Hinfl cleaves the 260 bp Mbol 
fragment once 99 bp from the 3' end of the fragment and again 42 bp from the end nearest the NOS 
coding region. Two clones of each orientation were obtained. One clone, digested as M-2 as shown in 
Figure 8, contained the 260 bp fragment with the EcoRI site at the 3' end of the fragment. M-2 RF DNA was 
prepared using the procedures of Messing, et al 1981. 
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Example 7: Creation of pMON75 and pMON76 

Fifty ug of M-2 RF DNA (described in Example 6) were digested with 50 units of EcoRI and 50 units of 
BamHI for 2 hours at 37° C. The 270 bp fragment (1 ug) was purified using agarose gel and NA-45 

5 membrane. Plasmid pMON58 (described in Example 4) was digested with EcoRI and BamHI (50 ug, 50 
units each, 2 hours, 37° C) and the 1300 bp fragment was purified using NA-45 membrane. The 270 bp 
EcoRI-BamHI (0.1 ug) and 1300 bp EcoRI-BamHI (0.5 ug) fragments were mixed, treated with T4 DNA 
ligase (2 units) for 12 hours at 14° C. After heating at 70° C for 10 minutes to inactivate the ligase, the 
mixture was treated with EcoRI (10 units) for 1 hour at 37° C, then heated to 70 °C for 10 minutes to 

10 inactivate the EcoRI. This completed the assembly of a chimeric NOS-NPT ll-NOS gene on a 1.6 kb 
fragment, as shown on Figure 9. 

Plasmid pMON38 is a clone of the pTiT37 Hindlll-23 fragment inserted in the Hindlll site of pBR327 
(Soberon et al, 1980). pMON38 DNA (20 ug) was digested with EcoRI (20 units, 2 hours, 37° C) and calf 
alkaline phosphatase (0.2 units, 1 hour, 37° C). The pMON38 DNA reaction was extracted with phenol, 

75 precipitated with ethanol, dried and resuspended in 20 ul of 10 mM Tris-HCI, 1 mM EDTA, pH 8. 



0.2 ug of the cleaved pMON38 DNA was added to the chimeric gene mixture described above. The mixture 
was treated with T4 DNA ligase (4 units, 1 hour, 22° C) and mixed with Rb chloride-treated E. coli C600 rec 

20 A56 cells to obtain transformation. After plating with selection for ampicillin-resistant (200 ug/ml) colonies, 
63 potential candidates were obtained. Alkaline mini-preps of plasmid DNA were made from 12 of these and 
screened by restriction endonuclease digestion for the proper constructs. Plasmid DNA's that contained a 
1.5 kb EcoRI fragment and a new Bglll site were digested with BamHI to determine the orientation of the 
1.5 kb EcoRI fragment. One of each insert orientation was picked. One plasmid was designated pMON75 

25 and the other pMON76, as shown in Figure 9. DNA from these plasmids were prepared as described in 
previous examples. 



Example 8: Creation of plasmids PMON128 and pMON129 

30 

The 1.5 kb EcoRI fragment was excised by EcoRI digestion from either pMON75 or pMON76 and 
purified after agarose gel electrophoresis as described in previous examples. Five ug of DNA from plasmid 
pMON120 (described in a separate application, "Plasmids for Transforming Plant Cells," (WO84/0291 9) 
cited previously) was digested with EcoRI and treated with calf alkaline phosphatase. After phenol 

35 deproteinization and ethanol precipitation, the EcoRI-cleaved pMON120 linear DNA was mixed with 0.5 ug 
of the 1 .5 kb EcoRI chimeric gene fragment. The mixture was treated with 2 units of T4 DNA ligase for 1 
hour at 22oC. After transformation of E. coli cells (Maniatis et al, 1982) and selection of colonies resistant to 
spectinomycin (50 ug/ml), several thousand colonies appeared. Six of these were picked, grown, and 
plasmid mini-preps made. The plasmid DNA's were digested with EcoRI to check for the 1 .5 kb chimeric 

40 gene insert and with BamHI to determine the orientation of the insert. BamHI digestion showed that in 
pMON128 the chimeric gene was transcribed in the same direction as the intact nopaline synthase gene of 
pMON120. The orientation of the insert in pMON129 was opposite that in pMON128; the appearance of an 
additional 1.5 kb BamHI fragment in digests of pMON129 showed that plasmid pMON129 carried a tandem 
duplication of the chimeric NOS-NPT ll-NOS gene, as shown in Figure 10. 

45 

Example 9: Creation of Plasmid pMON156 

Plasmids which contained CaMV DNA were a gift to Monsanto Company from Dr. R. J. Shepherd, 
50 University of California, Davis. To the best of Applicants' knowledge and belief, these plasmids (designated 
as pOSI) were obtained by inserting the entire genome of a CaMV strain designated as CM4-184 (Howarth 
et 1, 1981) into the Sal I restriction site of a pBR322 plasmid (Bolivar et al, 1977). E. coli cells transformed 
with pOSI were resistant to ampicillin (Amp R ) and sensitive to tetracycline (Tet s ). 

Various strains of CaMV suitable for isolation of CaMV DNA which can be used in this invention are 
55 publicly available; see, e.g., ATCC Catalogue of Strains II, p. 387 (3rd edition, 1981). 

pOSI DNA was cleaved with Hindlll. Three small fragments were purified after electrophoresis on an 
0.8% agarose gel using NA-45 membrane (Schleicher and Schuell, Keene NH). The smallest fragment, 
about 500 bp in size, contains the 19S promoter. This fragment was further purified on a 6% acrylamide 
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gel. After various manipulations which did not change the sequence of this fragment (shown in Figure 28), it 
was digested with Mbol to create a 455 bp Hindlll-Mbol fragment. This fragment was mixed with a 1250 bp 
fragment obtained by digesting pMON75 (described in Example 7 and shown in Figure 9) with Bglll and 
EcoRI. This fragment contains the NPTII structural sequence and the NOS 3' non-translated region. The two 
5 fragments were ligated together by their compatible Mbol and Bglll overhangs to create a fragment 
containing the CaMV(19S)-NPTII-NOS chimeric gene. This fragment was inserted into pMON120 which had 
been cleaved with Hindlll and EcoRI. The resulting plasmid was designated as pMON156, as shown in 
Figure 29. 

Plasmid pMON156 was inserted into E. coli cells and subsequently into A. tumefaciens cells where it 
w formed a co-integrate Ti plasmid having the CaMV(19S)-NPT ll-NOS chimericTgene surrounded by T-DNA 
borders. A. tumefaciens cells containing the co-integrate plasmids were co-cultivated with petunia cells. The 
co-cultivated petunia cells were cultured on media containing kanamycin. Some of the co-cultivated petunia 
cells survived and produced colonies on media containing up to 50 ug/ml kanamycin. This indicated that the 
CaMV(19S)-NPT ll-NOS genes were expressed in petunia cells. These results were confirmed by Southern 
75 blot analysis of transformed plant cell DNA. 



Example 10: Creation of pMON155 

20 Plasmid pMON72 was obtained by inserting a 1.8 kb Hindlll-BamHI fragment from bacterial transposon 
Tn5 (which contains an NPTII structural sequence) into a Pstl- pBR327 plasmid digested with Hindlll and 
BamHI. This plasmid was digested with Bglll and Pstl to remove the NPTII structural sequence. 

Plasmid pMON1001 (described in Example 1 and shown in Figure 6) from dam- cells was digested with 
Bglll and Pstl to obtain a 218 bp fragment with a partial NPTII structural sequence. This fragment was 

25 digested with Mbol to obtain a 194 bp fragment. 

A triple ligation was performed using (a) the large Pstl-Bglll fragment of pMON72; (b) the Pstl-Mbol 
fragment from pMON1001; and (c) a synthetic linker with Bglll and Mbol ends having stop codons in all 
three reading frames. After transformation of E. coli cells and selection for ampicillin resistant colonies, 
plasmid DNA from Amp R colonies was analyzed Acolony containing a plasmid with the desired structure 

30 was identified. This plasmid was designated pMON110, as shown on Figure 30. 

In order to add the 3' end of the NPT II structural sequence to the 5' portion in pMON110, pMON110 
was treated with Xhol. The resulting overhanging end was filled in to create a blunt end by treatment with 
Klenow polymerase and the four deoxy-nucleotide triphosphates (dNTP's), A, T, C, and G. The Klenow 
polymerase was inactivated by heat, the fragment was digested with Pstl, and a 3.6 kb fragment was 

35 purified. Plasmid pMON76 (described in Example 7 and shown in Figure 9) was digested with Hindlll, filled 
in to create a blunt end with Klenow polymerase and the four dNTP's, and digested with Pstl. An 1100 bp 
fragment was purified, which contained part of the NPT II structural sequence, and a nopaline synthase 
(NOS) 3' non-translated region. This fragment was ligated with the 3.6 kb fragment from pMON110. The 
mixture was used to transform E. coli cells; Amp R cells were selected, and a colony having a plasmid with 

40 the desired structure was identified? This plasmid was designated pMON132, as shown on Figure 31. 
Plasmid pMON93 (shown on Figure 28) was digested with Hindlll, and a 476 bp fragment was isolated. This 
fragment was digested with Mbol, and a 455 bp fragment was purified which contained the CaMV (19S) 
promoter region and 5' non-translated region. Plasmid pMON132 was digested with EcoRI and Bglll to 
obtain a 1250 bp fragment with (1) the synthetic linker equipped with stop codons in all three reading 

45 frames; (2) the NPT II structural sequence; and (3) the NOS 3' non-translated region. These two fragments 
were joined together through the compatible Mbol and Bglll ends to create a CaMV (19S)-NPT ll-NOS 
chimeric gene. 

This gene was inserted into pMON120, which was digested with Hindlll and EcoRI, to create plasmid 
pMON155, as shown in Figure 32. 

so Plasmid pMON155 was inserted into A. tumefaciens GV3111 cells containing a Ti plasmid, pTiB653. 
The pMON155 plasmid formed a cointegrate plasmid with the Ti plasmid by means of a single crossover 
event. Cells which contain this co-integrate plasmid have been deposited with the American Type Culture 
Collection, and have been assigned ATCC accession number 39336. A fragment which contains the 
chimeric gene of this invention can be obtained by digesting the co-integrate plasmid with Hindlll and 

55 EcoRI, and purifying the 1.7 kb fragment. These cells have been used to transform petunia cells, allowing 
the petunia cells to grow on media containing at least 100 ug/ml kanamycin. 
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Example 11: Creation of PMON183 and 184 

Plasmid pOSI (described in Example 9) was digested with Bglll, and 1200 bp fragment was purified. 
This fragment contained the 32S promoter region and part of the 5' non-translated region. It was inserted 
s into plasmid pSHL72 which had been digested with BamHI and Bglll (pSHL72 is functionally equivalent to 
pAG060, described in Colbere-Garapin et al, 1981). The resulting plasmid was designated as pMON50, as 
shown on Figure 33. 

The cloned Bglll fragment contains a region of DNA that acts as a polyadenylation site for the 32S RNA 
transcript. This polyadenylation region was removed as follows: pMON50 was digested with Avail and an 

w 1100 bp fragment was purified. This fragment was digested with EcoRI* and EcoRV. The resulting 190 bp 
EcoRI-EcoRV fragment was purified and inserted into plasmid pBR327, which had been digested with 
EcoRI and EcoRV. The resulting plasmid, pMON81, contains the CaMV 32S promoter on a 190 bp EcoRV- 
EcoRI fragment, as shown on Figure 33. 

To make certain the entire promoter region of CaMV(32S) was present in pMON81, a region adjacent to 

15 the 5' (EcoRV) end of the fragment was inserted into pMON81 in the following way. Plasmid pMON50 
prepared from dam- cells was digested with EcoRI and Bglll and the resultant 1550 bp fragment was 
purified and digested with Mbol. The resulting 725 bp Mbol fragment was purified and inserted into the 
unique Bglll site of plasmid pKC7 (Rao and Rogers, 1979) to give plasmid pMON125, as shown in Figure 
34. The sequence of bases adjacent to the two Mbol ends regenerates Bglll sites and allows the 725 bp 

20 fragment to be excised with Bglll. 

To generate a fragment carrying the 32S promoter, the 725 bp Bglll fragment was purified from 
pMON125 and was subsequently digested with EcoRV and Aiul to yield a 190 bp fragment. Plasmid 
pMON81 was digested with BamHI, treated with Klenow polymerase and digested with EcoRV. The 3.1 kb 
EcoRV-BamHI(blunt) fragment was purified, mixed with the 190 bp EcoRV-Alul fragment and treated with 

25 DNA ligase. Following transformation and selection of ampicillin-resistant cells, plasmid pMON172 was 
obtained which carries the CaMV(32S) promoter sequence on a 380 bp BamHI-EcoRI fragment, as shown 
on Figure 35. This fragment does not carry the poly-adenylation region for the 32S RNA. Ligation of the Alul 
end to the filled-in BamHI site regenerates the BamHI site. 

To rearrange the restriction endonuclease sites adjacent to the CaMV(32S) promoter, the 380 bp 

30 BamHI-EcoRI fragment was purified from pMON172, treated with Klenow polymerase, and inserted into the 
unique Smal site of phage M13 mp8. One recombinant phage, M12, carried the 380 bp fragment in the 
orientation shown on Figure 36. The replicative form DNA from this phage carries the 32S promoter 
fragment on an EcoRI(5')-BamHI(3') fragment. 

Plasmids carrying a chimeric gene (CaMV(32S) promoter region-NPT II structural sequence-NOS 3' 

35 non-translated region) were assembled as follows. The 380 bp EcoRI-BamHI CaMV (32S) promoter 
fragment was purified from phage M12 RF DNA and mixed with the 1250 bp Bglll-EcoRI NPT ll-NOS 
fragment from pMON75. Joining of these two fragments through their compatible BamHI and Bglll ends 
results in a 1.6 kb CaMV(32S)-NPT il-NOS chimeric gene. This gene was inserted into pMON120 at the 
EcoRI site in both orientations. The resultant plasmids, pMON183 and 184, appear in Figure 37. These 

40 plasmids were used to transform petunia cells. The transformed cells are capable of growth on media 
containing 100 ug/ml kanamycin. 
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45 Claims 

1. A chimeric gene capable of expressing a polypeptide in plant cells comprising in sequence: 

(a) a promoter region from a gene which is naturally expressed in plant cells; 

(b) a 5' non-translated region; 

50 (c) a structural coding sequence encoding a neomycin phosphotransferase polypeptide; and 

(d) a 3' non-translated region of a gene naturally expressed in plant cells, said region encoding a 
signal sequence for polyadenylation of mRNA; said promoter being heterologous with respect to the 
structural coding sequence. 

55 2. A gene of Claim 1 in which the promoter is selected from a gene of the group consisting of a nopaline 
synthase gene and a ribulose -1 ,5-bis-phosphate carboxylase small subunit gene. 

3. A gene of Claim 1 in which the 3' non-translated region is selected from a gene from the group 
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consisting of the genes from the T-DNA region of Agrobacterium tumefaciens. 

4. A gene of Claim 1 or 2 in which the 3' non-translated region is from the nopaline synthase gene of 
Agrobacterium tumefaciens. 

5. A chimeric gene capable of expressing a polypeptide in plant cells comprising in sequence: 

(a) a promoter region from a plant virus; 

(b) a 5' non-translated region; 

(c) a structural coding sequence; 

(d) a 3' non-translated region of a gene naturally expressed in plants, said region encoding a signal 
sequence for polyadenylation of mRNA. said structural coding sequence being heterologous with 
respect to said promoter region. 

6. A gene of claim 5 in which the promoter is from cauliflower mosaic virus. 

7. A gene of claim 6 in which the 3' non-translated region is from a nopaline synthase gene. 

8. A gene of claim 5 in which the promoter is the full-length transcript promoter of cauliflower mosaic 
virus. 

9. A gene of claim 8 in which the 3' non-translated region is from a nopaline synthase gene. 

10. A culture of microorganisms identified by ATCC accession number 39265. 



Revendications 

1. Gene chimerique capable d'exprimer un polypeptide dans des cellules vegetales, comprenant succes- 
sivement : 

(a) une region promotrice provenant d'un gene qui est naturellement exprime dans des cellules 
vegetales; 

(b) une region non traduite 5'; 

(c) une sequence de codage structurale codant un polypeptide, la neomycine phosphotransferase; et 

(d) une region non traduite 3' d'un gene exprime naturellement dans les cellules vegetales, cette 
region codant une sequence de signal pour la polyadenylation de I'ARNm; 

ce promoteur etant heterologue par rapport a la sequence de codage structurale. 

2. Gene selon la revendication 1, dans lequel le promoteur est choisi parmi le gene de la nopaline 
synthase et un gene qui est une petite sousunite de ribulose -1 ,5-bis-phosphatee carboxylase. 

3. Gene selon la revendication 1 , dans lequel la region non traduite 3' est choisie parmi les genes de la 
region ADN-T d'Agrobacterium tumefaciens. 

4. Gene selon les revendications 1 ou 2, dans lequel la region non traduite 3' provient du gene de la 
nopaline synthase d'Agrobacterium tumefaciens. 

5. Gene chimerique capable d'exprimer un polypeptide dans des cellules vegetales, comprenant succes- 
sivement : 

(a) une region promotrice d'un virus de plante; 

(b) une region non traduite 5'; 

(c) une sequence de codage structurale; 

(d) une region non traduite 3' d'un gene exprime naturellement dans les plantes, cette region codant 
une sequence de signal pour la poly-adenylation de I'ARNm; cette sequence de codage structurale 
.etant heterologue par rapport a cette region promotrice. 

6. Gene selon la revendication 5, dans lequel le promoteur est le virus de la mosaique du choux-fleur. 

7. Gene selon la revendication 6, dans lequel la region non traduite 3' provient d'u, gene de la nopaline 
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synthase. 

8. Gene selon la revendication 5, dans lequel le promoteur est le promoteur de transcription en pleine 
longueur du virus de la mosa'fque du choux-fleur. 

9. Gene selon la revendication 8, dans lequel la region non traduite 3' provient d'un gene de la nopaline 
synthase. 

10. Culture de microorganismes, identified par le numero d'inscription ATCC 39265. 



Anspriiche 

1. Chimares Gen, das ein Polypeptid in Pflanzenzellen exprimieren kann und in Sequenz umfafit: 

(a) einen Promotorbereich von einem Gen, das in Pflanzenzellen naturlich exprimiert wird; 

(b) einen 5'-nicht-ubersetzten Bereich; 

(c) eine strukturelle Kodiersequenz, die ein Neomycin-Phosphotransferase-Polypeptid kodiert; und 

(d) einen 3'-nicht-ubersetzten Bereich eines in Pflanzenzellen naturlich exprimierten Gens, welcher 
Bereich eine Signalsequenz fur Polyadenylierung von mRNA kodiert; wobei der Promotor in bezug 
auf die strukturelle Kodiersequenz heterolog ist. 

2. Gen nach Anspruch 1 , in dem der Promotor gewShlt ist von einem Gen der Gruppe bestehend aus 
einem Nopalin-Synthase-Gen und einem Ribulose-1 ,5-biphosphat-Carboxylase-Gen kleiner Subeinheit. 

3. Gen nach Anspruch 1, in dem der 3'-nicht-ubersetzte Bereich gewahlt ist von einem Gen der Gruppe 
bestehend aus den Genen vom T-DNA-Bereich von Agrobacterium tumefaciens. 

4. Gen nach Anspruch 1 oder 2, in dem der 3'-nicht-ubersetzte Bereich vom Nopalin-Synthase-Gen von 
Agrobacterium tumefaciens ist. 

5. Chimares Gen, das ein Polypeptid in Pflanzenzellen exprimieren kann und in Sequenz umfaflt: 

(a) einen Promotorbereich von einem Pflanzenvirus; 

(b) einen 5'-nicht-ubersetzten Bereich; 

(c) eine strukturelle Kodiersequenz und 

(d) einen 3'-nicht-ubersetzten Bereich eines in Pflanzenzellen naturlich exprimierten Gens, welcher 
Bereich eine Signalsequenz fur Polyadenylierung von mRNA kodiert; wobei die strukturelle Kodier- 
sequenz in bezug auf den Promotorbereich heterolog ist. 

6. Gen nach Anspruch 5, in dem der Promotor vom Blumenkohlmosaikvirus ist. 

7. Gen nach Anspruch 6, in dem der 3'-nicht-ubersetzte Bereich von einem Nopalin-Synthase-Gen ist. 

8. Gen nach Anspruch 5, in dem der Promotor der Transkriptpromotor in voller GroBe vom Blumenkohl- 
mosaikvirus ist. 

9. Gen nach Anspruch 8, in dem der 3'-nicht-ubersetzte Bereich von einem Nopalin-Synthase-Gen ist. 

10. Kultur von Mikroorganismen identifiziert durch die ATCC Nr. 39265. 
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ISOLATE FRAGMENT CONTAINING 
{ NOS) PLANT PROMOTER REGION, 
5 ' NON -TRANSLATED REGION AND 
START OF STRUCTURAL SEQUENCE 



ISOLATE FRAGMENT CONTAINING 
HETEROLOGOUS ( NPT- 1 1 ) 
STRUCTURAL SEQUENCE 



44 REMOVE STRUCTURAL SEQUENCE 50 
FROM FRAGMENT 



INSERT FRAGMENT INTO PLASMID 
WITH CLEAVAGE SITE NEAR 
START CODON (pM0N40) 



ISOLATE FRAGMENT WITH (NOS) 
PLANT PROMETER REGION ANO 
5' NON-TRANSLATED REGION 



CLEAVE PLASMID AT SITE 
NEAR HETEROLOGOUS START 
CODON 



INSERT (NOS) PLANT PROMOTER FRAGMENT INTO PLASMID 
(pMON 40); RELIGATE; SELECT CELLS WITH FRAGMENT 
IN PROPER ORIENTATION {pMON 58) 
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CLEAVE PLASMIO ( pMON 58 ), 
ISOLATE FRAGMENT CONTAINING 
(NOS) PLANT PROMOTER REGION 
AND (NPT II) HETEROLOGOUS 
STRUCTURAL SEQUENCE 



ISOLATE FRAGMENT CONTAINING 
3' NON-TRANSLATED REGION OF 
PLANT (NOS) GENE 
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LIGATE FRAGMENTS TOGETHER IN PROPER 
ORIENTATION TO OBTAIN CHIMERIC 
(NOS-NPT ll-NOS)GENE 



INSERT CHIMERIC GENE INTO SELECTED 
PLASMID IpMONIZO) TO OBTAIN CHIMERIC 
PLASMID (pMON I28, pMON I29) 

Z . v 



INSERT CHIMERIC PLASMID 
INTO PLANT CELLS 



IDENTIFY TRANSFORMED 
PLANT CELLS 



INSERT CHIMERIC PLASMID 
INTO A, TUMEFACIENS TO 
CREATE CO-INTEGRATE Tl 
PLASMID 



INFECT PLANT CELLS WITH 
A, TUMEFACIENS 



IDENTI FY TRANSFORMED 
PLANT CELLS 



FIG. 2. 



32 



EP 0 131 623 B1 




33 



EP 0 131 623 B1 



o< 



O CO 
lO o 

~ < 



o»- 
oo 

NO 



too 



/ 
O < I- 
Oo o 
woo 

K < 
O O 
H < 

5Si| 8 



<2 



O -a 

So" 1 . 

zccz 
°q:< 
o ujir 

zo« 
Pz 

oOi 
lOZI- 

= i 

tn 3 



o 0 



o< 

- O 

o 



L. XS ^o 

i_ <n o 



oo 

CD H 

— o 



o< 

TO 
M O 



CM q 



°5 
CVJ < 



o< t oo 

J< H *0 

So O ^O 

O O k 

2 "« 5 

O H- 



2f- 



25 



o o 


o< 


O t- 


o< 


— h- 

O 


US o 

-o 


~< 


o o 

«M O 


< 


< 






o 


< 


< 


< 


o 


< 


< 


1- 


o 


o 


< 


o 


< 


< 


O 


o 


o 




o 




o 




< 





C 



cr lu 
oo 

HZ 
OUi 
DO 

ceo 



ceo 

<2 

► HO 
coo 



34 



EP 0 131 623 B1 




35 



EP 0 131 623 B1 




EP 0 131 623 B1 




37 



EP 0 131 623 B1 




EP 0 131 623 B1 




DIGEST WITH EcoRI AND 
Bom HI PURIFY I 300 bp 
FRAGMENT 



(SEE FIG.8.) 



DIGEST WITH EcoRI 
AND Bam HI PURIFY 
280bp FRAGMENT 



EcoRI 



Bgl II 



BamHI Bam HI 



EcoRI 



NOS PROMOTER NPT 1 1 STRUCTURAL NOS 3 ' NON - TRANSLATED 
REGION ANO 5' SEQUENCE AND REGION 
NON-TRANSLATED 3 ' NON -TRANSLATED 
REGION REGION 



EcoRI 




pMON 75 
8.5 kb 



NOS 

pMON 76 
8.5 kb 



FIG. 9. 
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NOS PROMOTER, 5' 
NON -TRANSLATED 
EcoRI REGIONS 
, Pst I 



Amp" 
pMON 73 



' STRUCTURAL \ 
SEQUENCE 
pMON 66 



DIGEST Pst I, 

Bom HI PURIFY 2.4Kb 

FRAGMENT 



, EcoRI 



DIGEST Xho I , Bom HI 
PURIFY 950 bp 
FRAGMENT 



-CTGCA-3' SYNTHETIC LINKER 

PROMOTER ^ ^* ' 
>'NON-TRANS- / ATGAGCCATATTCAACGGAAACGTCTTGC 
I LATED 3'- A C GTTA CTCGGTATA AGTTG CCTTTGCAGA ACG AGCT- 5 ' 

RE<3,0NS .BomH.^ XholN^ 

-G 5-GATCC C ' * 

-CCTAG-5' G GAGCT-5' 



MIX, LIGATE, TRANSFORM, 
SELECT Amp R CELLS 



PARTIAL NPT I 
STRUCTURAL 
SEQUENCE 




NOS 
PROMOTER, 
' 5' NON -TRANSLATED 



SYNTHETIC 
y LINKER 

• Xho I 



NPT I 
STRUCTURAL^ 
SEQUENCE - 



FIG. 13. 
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NOS 

POLY-A SIGNAL 



DIGEST WITH EcoRI, 
Bam HI PURIFY I300bp 
FRAGMENT 



EcoRI EcoRI 




DIGEST WITH EcoRI, 
Bam HI PURIFY 280bp 
FRAGMENT 



Xhol BamHI BamHI EcoRI 



V 



NOS NPT I 

PROMOTER STRUCTURAL 
5'NONTRANS- SEQUENCE NOS 3' 
LATED REGIONS NONTRANSLATED 
REGION 



FIG. 14 
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EcoRI 




MIX, LIGATE TRANSFORM, 
SELECT Spc* CELLS 




FIG, 15, 
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EcoRI 



S0Y8EAN SMALL SU8UNIT 
PROMOTER FRAGMENT 





DIGEST WITH Mbo I 
PURIFY 800 bp 
FRAGMENT 



Hind 111 Mbol 



Sbss PROMOTER 
5' NONTRANSLATED 
REGIONS 




MIX, LIG ATE.TRANSFORM, 
SELECT Am pR CELLS 



Hind III 
Mbo 1/ Bgl II JOINT 



Bel I 

FIG. 17. 
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NOS 3' 

NONTRANS- 
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EcoRI 




MIX, LIGATEJRANSFORM, 
SELECT Amp" CELLS 




Bgl II 
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DIGEST WITH Nde I, 
BstE II 
PURIFY 



DIGEST WITH 
Nd«l,8amHI 
PURIFY 4.9Kb 
FRAGMENT 
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EcoRI 





Bgl II 



DIGEST WITH EcoRI, 
Hind III 
PURIFY 290 bp 
FRAGMENT 



Bgl II 



NOS 3'NONTRANSLATED 
REGION 



MIX, LIGATE.TRANSFORM, 
SELECT Amp" CELLS 



FIG. 25. 




pMON I46 EPSP , 

SYNTHASE/ 
STRUCTURAL / 
SEQUENCE / 



-Bgl II 
Bo m H I 
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8gl II 



DIGEST EcoR I, 
Bql II 

PURIFY I250bp 
FRAGMENT 



EcoR I 




POLY A 
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BQl II 




Bglll 



SPURIOUS 
AT 6 
-Mboi 



STRUCT. 
SEQ. 
pMON IOOI 
FROM dom 3 
CELLS 



Bom HI 



P*tl 

Xhol 
DIGEST Pst I, 8gl II 
PURIFY 3.4 Kb 
FRAGMENT 



PARTIAL NPT II 

STRUCTURAL 

SEQUENCE 




DIGEST Bgl II, Pst I 
PURIFY 2l8bp FRAGMENT 



G ATC TAGT TAGTTA ATCTAGAC Mbo I 
ATC A ATCAATTAGATCTGCTAG 

SYNTHETIC 5 > h 

LINKER 



TAG^J 



-C 

- GCTAG 



DIGEST Mbo I 
PURIFY 194 bp 
FRAGMENT 



Xb0 , .-—SYNTHETIC LINKER 
Mbo I 
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Bqlll 



Bam HI 

NOS 
POLY-A 
SIGNAL 
DIGEST Eco Rl 
Bglll 

PURIFY I250bp 
FRAGMENT 




FIG. 32. 



pMON 1 55 



Spc/ 



Sac I 



I9S /Str 
k PROMOTER wn . 
> REGIONS N0 .? 
MODIFIED 
NPTII STRUCT. ZQ 
SEQ. ' 



'Bgl II 
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EcoRI 




FRAGMENT 
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Bg! II 



ScoRX 




DIGEST Bam HI 
FILL-IN WITH KLENOW 
POLYMERASE + 4dNTPs 



OIGEST EcoRTT 
PURIFY 3.1 Kb 
FRAGMENT 



89I II 



EcoRT 1 Alu I 



Bglll 

=1 




DIGEST EcoRTT 
Alu I 

PURIFY I90bp 
FRAGMENT 

Alu I 



FIG. 35. 



MIX.LlGATE, 
TRANSFORM, SELECT 
Amp* CELLS 



>Amp R 

CoMV 32S^ 
PROMOTER, 
5' NON- 
TRANSLATED ) 
REGIONS 

pMON 172 
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DIGEST EcoRI, 

Bam HI 
FILL-IN ENDS WITH 
KLENOW POLYMERASE 
+ 4d NTPs 
PURIFY 380 bp 
FRAGMENT 



MIX.LIGATE, 
TRANSFORM, SELECT 
Amp " CELLS 





DIGEST Smo I, CAP 



FIG. 36. 
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